Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 2 items • Updated about 2 hours ago • 4
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 3 days ago • 46
AceMath Collection We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 9 days ago • 8
SwiftKV Models Collection SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. • 4 items • Updated 3 days ago • 5
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 19 items • Updated 18 days ago • 87
Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated 9 days ago • 30
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 18 days ago • 248
Riva Collection A family of Riva production (NVAIE) speech models that achieve state-of-the-art results on speech transcription, translation, and synthesis tasks. • 1 item • Updated 9 days ago • 3
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 28 days ago • 11