Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 8 days ago • 311
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 12 days ago • 284
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 20 days ago • 132
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 12 days ago • 64
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 26 days ago • 49
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 18 days ago • 37
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper • 2412.10302 • Published Dec 13, 2024 • 12