🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 14 items • Updated 2 days ago • 100
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published about 1 month ago • 51