Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper • 2501.18512 • Published 4 days ago • 22
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 8 days ago • 311
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published 11 days ago • 41
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 11 days ago • 64
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 12 days ago • 284
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 13 days ago • 48
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 17 days ago • 42