Running on CPU Upgrade 78 78 Open Japanese LLM Leaderboard 🌸 Explore and compare LLM models through interactive leaderboards and submissions
future-technologies/Universal-Transformers-Dataset Viewer • Updated about 7 hours ago • 1.55M • 374 • 21
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published Feb 11 • 53
multilingual_domain_datasets Collection Multilingual datasets. Excluding those which are just a cleaned version of CC. • 3 items • Updated Feb 17
multilingual_domain_datasets Collection Multilingual datasets. Excluding those which are just a cleaned version of CC. • 3 items • Updated Feb 17
multilingual_benchmark Collection For evaluating multilingual ability of LLMs • 1 item • Updated Feb 13