Gregor Betz
commited on
Commit
•
66b17ba
1
Parent(s):
3390c86
styling
Browse files- src/display/about.py +1 -1
src/display/about.py
CHANGED
@@ -69,7 +69,7 @@ Unlike these leaderboards, the `/\/` Open CoT Leaderboard assess a model's abili
|
|
69 |
|
70 |
## Test dataset selection (`tasks`)
|
71 |
|
72 |
-
The test dataset porblems in the CoT Leaderboard can be solved through clear thinking alone, no specific knowledge is required to do so. They are subsets of the [AGIEval benchmark](https://github.com/ruixiangcui/AGIEval) and re-published as `
|
73 |
|
74 |
|
75 |
## Reproducibility
|
|
|
69 |
|
70 |
## Test dataset selection (`tasks`)
|
71 |
|
72 |
+
The test dataset porblems in the CoT Leaderboard can be solved through clear thinking alone, no specific knowledge is required to do so. They are subsets of the [AGIEval benchmark](https://github.com/ruixiangcui/AGIEval) and re-published as [`logikon-bench`](logikon/logikon-bench). The `logiqa` dataset has been newly translated from Chinese to English.
|
73 |
|
74 |
|
75 |
## Reproducibility
|