Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? Paper • 2504.06514 • Published 7 days ago • 32
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 8 days ago • 77
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 8 days ago • 158
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published 12 days ago • 74
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published 20 days ago • 39
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 13 days ago • 116
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published 13 days ago • 34
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper • 2504.01014 • Published 14 days ago • 59