SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 10 days ago • 162
Towards Retrieval Augmented Generation over Large Video Libraries Paper • 2406.14938 • Published Jun 21, 2024 • 21
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 11 days ago • 96
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 15 days ago • 34
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 23 days ago • 318
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • 25 days ago • 33