Gregor Betz
commited on
Commit
•
3390c86
1
Parent(s):
3e94412
about datasets
Browse files- src/display/about.py +1 -1
src/display/about.py
CHANGED
@@ -69,7 +69,7 @@ Unlike these leaderboards, the `/\/` Open CoT Leaderboard assess a model's abili
|
|
69 |
|
70 |
## Test dataset selection (`tasks`)
|
71 |
|
72 |
-
The test dataset porblems in the CoT Leaderboard can be solved through clear thinking alone, no specific knowledge is required to do so. They are subsets of the AGIEval benchmark and re-published as `logikon-bench`. The `logiqa` dataset has been newly translated from Chinese to English.
|
73 |
|
74 |
|
75 |
## Reproducibility
|
|
|
69 |
|
70 |
## Test dataset selection (`tasks`)
|
71 |
|
72 |
+
The test dataset porblems in the CoT Leaderboard can be solved through clear thinking alone, no specific knowledge is required to do so. They are subsets of the [AGIEval benchmark](https://github.com/ruixiangcui/AGIEval) and re-published as `[logikon-bench](logikon/logikon-bench)`. The `logiqa` dataset has been newly translated from Chinese to English.
|
73 |
|
74 |
|
75 |
## Reproducibility
|