Spaces:
AIR-Bench
/
Running on CPU Upgrade

Commit History

refactor: refactor the benchmarks
3fcf957

nan commited on

fix: fix the format
5e03e4a

nan commited on

refactor: refactor the envs
4791ac5

nan commited on

refactor: refactor the envs
ba13e25

nan commited on

refactor: refactor the column settings
a7c0332

nan commited on

fix: add the missing files
2d272e2

nan commited on

refactor: refactor the benchmarks
649e0fb

nan commited on

refactor: move the data model
4eb64b4

nan commited on

refactor: remove the unnecessary variables
592bb62

nan commited on

refactor: move the column names to a seperated file
e2d3123

nan commited on

refactor: refactor the data loading part
2508d96

nan commited on

feat: refactor the data loading function
ebf3ceb

nan commited on

feat: use dataclass to manage the dataframes
59fa204

nan commited on

update about.py: add leaderboard backend link
49f004b

hanhainebula commited on

feat-replace-0-with-dash-0821 (#25)
716a3e3
verified

nan commited on

fix a bug in read_evals.py
439a031

hanhainebula commited on

fix a import bug in envs.py
7d7a455

hanhainebula commited on

chore: update the version
394f64e

nan commited on

feat-add-reranker-tab-0607 (#21)
64a83a1
verified

nan commited on

feat-add-meta-info-after-uploading-0607 (#20)
9d64883
verified

nan commited on

feat-switch-to-ndcg-for-qa-0607 (#19)
973bd2a
verified

nan commited on

feat-use-recall-as-default-metric-0605 (#18)
bbfe4c1
verified

nan commited on

feat-add-tabs-for-noreranker-0605 (#17)
b80bda9
verified

nan commited on

Merge branch 'main' of https://huggingface.co/spaces/AIR-Bench/leaderboard
c1df819

nan commited on

fix: fix the missing file
0646823

nan commited on

refactor-leaderboard-0605 (#16)
a0387d8
verified

nan commited on

fix-show-no-reranker-bug-0531 (#14)
dccb8fe
verified

nan commited on

fix a bug for anonymous retrieval model submisson
933b575
verified

hanhainebula commited on

fix-bug-in-selecting-domain-0522 (#13)
eb0f9c5
verified

nan commited on

fix a bug in METRIC_LIST
443f557

hanhainebula commited on

modify the default value of reranking selector
2a65b26

hanhainebula commited on

modify the default value of reranking selector
658d5a4

hanhainebula commited on

feat-convert-reranker-selection-to-dropdown-0521 (#12)
1f30550
verified

nan commited on

feat-rename-to-retrieval-methods-0520 (#11)
f000c74
verified

nan commited on

Fix bug in dataset_dict: "gpt-3" -> "gpt3"
8102fce
verified

hanhainebula commited on

Fix bug in dataset_dict: "health" -> "healthcare"
4a44211
verified

hanhainebula commited on

feat-improve-submission-page-0517 (#10)
e93da78
verified

nan commited on

fix-bug-in-show-details-0517 (#9)
7ca7624
verified

nan commited on

feat-add-benchmark-version-selector-0515 (#7)
2c23c13
verified

nan commited on

feat-add-no-reranker-button-0515 (#5)
9002757
verified

nan commited on

feat-add-reranker-url-validator-0515 (#4)
240d9ce
verified

nan commited on