Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
eipi1-0
's Collections
Awesome tiny LLMs
LM Preference datas
Remarkable LLMs
Code datas
LM datas
LM SFT datas
LLM RL papers
LLM Leaderboards
LLM Benchmark datas
LLM Benchmark datas
updated
Oct 25
Upvote
-
truthfulqa/truthful_qa
Viewer
•
Updated
Jan 4
•
1.63k
•
28.2k
•
209
tasksource/bigbench
Updated
May 11, 2023
•
4.94k
•
62
THUDM/LongBench
Viewer
•
Updated
8 days ago
•
8.42k
•
40.9k
•
131
tau/scrolls
Updated
Jan 12
•
2.84k
•
26
gsarti/flores_101
Viewer
•
Updated
Oct 27, 2022
•
207k
•
5.73k
•
23
nguha/legalbench
Updated
Sep 30
•
9.93k
•
97
google/bigbench
Updated
Jan 18
•
326
•
56
GEM/gem
Updated
Jan 18
•
4.83k
•
30
m-a-p/CMMMU
Viewer
•
Updated
Sep 5
•
12k
•
278
•
29
openai/gsm8k
Viewer
•
Updated
Jan 4
•
17.6k
•
168k
•
458
codeparrot/apps
Viewer
•
Updated
Oct 20, 2022
•
20k
•
4.98k
•
132
froggeric/creativity
Viewer
•
Updated
May 28
•
48
•
78
•
84
openai/openai_humaneval
Viewer
•
Updated
Jan 4
•
164
•
84.6k
•
255
Idavidrein/gpqa
Viewer
•
Updated
Mar 28
•
1.25k
•
17.8k
•
87
google/IFEval
Viewer
•
Updated
Aug 14
•
541
•
7.18k
•
44
TIGER-Lab/MMLU-Pro
Viewer
•
Updated
29 days ago
•
12.1k
•
38.7k
•
302
TAUR-Lab/MuSR
Viewer
•
Updated
May 21
•
756
•
8.12k
•
11
lighteval/MATH-Hard
Viewer
•
Updated
Jun 12
•
7.26k
•
8.65k
•
17
wis-k/instruction-following-eval
Viewer
•
Updated
Dec 5, 2023
•
541
•
9.67k
•
4
SaylorTwift/bbh
Viewer
•
Updated
Jun 16
•
6.76k
•
8.77k
•
3
m-a-p/CodeEditorBench
Preview
•
Updated
Apr 5
•
189
•
19
m-a-p/CHC-Bench
Viewer
•
Updated
Apr 8
•
214
•
331
•
8
openai/MMMLU
Viewer
•
Updated
Oct 16
•
393k
•
1.79k
•
438
lighteval/MATH
Viewer
•
Updated
Oct 17, 2023
•
25k
•
9.2k
•
61
cais/mmlu
Viewer
•
Updated
Mar 8
•
231k
•
113k
•
348
hails/agieval-logiqa-zh
Viewer
•
Updated
Jan 26
•
651
•
491
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections