Wikipedia in the Era of LLMs: Evolution and Risks Paper • 2503.02879 • Published 2 days ago • 18
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published 14 days ago • 91
Running 2.09k 2.09k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Demystifying Long Chain-of-Thought Reasoning in LLMs Paper • 2502.03373 • Published 29 days ago • 55
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published Feb 3 • 112
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 185
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 274
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 260
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models Paper • 2412.03548 • Published Dec 4, 2024 • 17