Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 148 • 40 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 4 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 32 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 102 • 1
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 306k • 1.01k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 139k • 262 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 261k • 90 LLM360/TxT360 Updated May 26, 2025 • 47.1k • 248
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 7.13k • 518 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 83.2k • 142
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 3.79k • 442 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 106 • 47
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 444 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 8.64k • 301
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 88 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 13 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 660 • 47
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 2.51k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 5.81k • 96 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 902 • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 49 • 21
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 28 • 38
Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 148 • 40 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 4 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 32 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 102 • 1
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 444 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 8.64k • 301
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 306k • 1.01k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 139k • 262 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 261k • 90 LLM360/TxT360 Updated May 26, 2025 • 47.1k • 248
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 88 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 13 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 660 • 47
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 7.13k • 518 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 83.2k • 142
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 2.51k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 5.81k • 96 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 902 • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 49 • 21
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 3.79k • 442 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 106 • 47
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 28 • 38