Access Sets Matter: Budgeting Expert Reads for Scalable Weight-Space Model Merging Paper • 2605.29489 • Published 8 days ago • 4
Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation Paper • 2605.26844 • Published 10 days ago • 25
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published 26 days ago • 52
InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities Paper • 2508.05496 • Published Aug 7, 2025 • 9
Running 3.87k The Ultra-Scale Playbook 🌌 3.87k The ultimate guide to training LLM on large GPU Clusters