Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 60
nvidia/Llama-Nemotron-Post-Training-Dataset-v1 Viewer • Updated 19 days ago • 15.2M • 12.8k • 327
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 5 days ago • 118k • 1.06k
Running 2.41k 2.41k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge Paper • 2411.16594 • Published Nov 25, 2024 • 40
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 62