Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Paper • 2502.14846 • Published about 18 hours ago • 6
Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published 3 days ago • 34
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 4 days ago • 80
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 8 days ago • 180
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Paper • 2502.07374 • Published 10 days ago • 31
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 11 days ago • 35
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 10 days ago • 22
view article Article Topic 27: What are Chain-of-Agents and Chain-of-RAG? By Kseniase and 1 other • 8 days ago • 10
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 10 days ago • 48
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 11 days ago • 132
Expect the Unexpected: FailSafe Long Context QA for Finance Paper • 2502.06329 • Published 11 days ago • 122
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published 18 days ago • 9
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 17 days ago • 187
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 29 days ago • 63