Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Evaluation datasets
community
AI & ML interests
None defined yet.
Team members
7
models
None public yet
datasets
67
Sort: Recently updated
lighteval/hellaswag_thai
Viewer
•
Updated
9 days ago
•
25.6k
lighteval/ChineseSquad
Viewer
•
Updated
Aug 3
•
76.4k
lighteval/thaiqa_squad_fixed
Viewer
•
Updated
Aug 1
•
4.07k
lighteval/KenSwQuAD
Viewer
•
Updated
Aug 1
•
7.5k
lighteval/MATH-Hard
Viewer
•
Updated
Jun 12
•
7.26k
•
844k
•
12
lighteval/aimo_progress_prize_1
Viewer
•
Updated
Apr 10
•
10
•
6
lighteval/mt-bench
Viewer
•
Updated
Mar 19
•
80
•
220
•
1
lighteval/bbh
Updated
Jan 31
•
12k
•
1
lighteval/big_bench_hard
Viewer
•
Updated
Oct 17, 2023
•
6.26k
•
1.38k
•
3
lighteval/MATH
Viewer
•
Updated
Oct 17, 2023
•
25k
•
69.4k
•
47
Expand 67 datasets