DeepSeek-R1 Collection by deepseek-ai 2 days ago 94 deepseek-ai/DeepSeek-R1 Text Generation • Updated about 1 hour ago • 44.6k • 1.87k deepseek-ai/DeepSeek-R1-Zero Text Generation • Updated 44 minutes ago • 3.04k • 359 deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • Updated about 1 hour ago • 10.5k • 190 deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 1 hour ago • 63.7k • • 370
DeepSeek R1 (All Versions) DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. Collection by unsloth 2 days ago 53 unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF Updated 2 days ago • 37.4k • 85 unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF Updated 3 days ago • 21.8k • 47 unsloth/DeepSeek-R1-Distill-Qwen-14B-GGUF Updated 3 days ago • 14.7k • 31 unsloth/DeepSeek-R1-Distill-Qwen-7B-GGUF Updated 3 days ago • 14.2k • 39
Cosmos The collection of Cosmos models Collection by nvidia 6 days ago 245 nvidia/Cosmos-1.0-Guardrail Updated 13 days ago • 5.44k • 41 nvidia/Cosmos-1.0-Autoregressive-4B Updated 13 days ago • 2.11k • 46
DeepSeek-V3 Collection by deepseek-ai 17 days ago 130 deepseek-ai/DeepSeek-V3-Base Updated 24 days ago • 19.8k • 1.3k deepseek-ai/DeepSeek-V3 Updated 24 days ago • 200k • 2.19k DeepSeek-V3 Technical Report Paper • 2412.19437 • Published 27 days ago • 27
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen Nov 28, 2024 472 Running 613 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated Sep 25, 2024 • 337k • • 162 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated Sep 25, 2024 • 581k • • 190 Qwen/Qwen2.5-1.5B Text Generation • Updated Oct 8, 2024 • 91.2k • 56
GTE ModernBERT GTE Models Based on ModernBERT Collection by Alibaba-NLP 2 days ago 10 Alibaba-NLP/gte-modernbert-base Sentence Similarity • Updated about 19 hours ago • 519 • 50 Alibaba-NLP/gte-reranker-modernbert-base Sentence Similarity • Updated about 2 hours ago • 219 • 29
Jan 17 Releases ❄️ Models and datasets of the second week of Jan 2025. Collection by merve 6 days ago 10 openbmb/MiniCPM-o-2_6 Any-to-Any • Updated about 2 hours ago • 50.8k • 773 MiniMaxAI/MiniMax-Text-01 Text Generation • Updated 6 days ago • 3.94k • 463 OuteAI/OuteTTS-0.3-1B Text-to-Speech • Updated 6 days ago • 8.11k • 79 NovaSky-AI/Sky-T1_data_17k Viewer • Updated 9 days ago • 16.4k • 2.71k • 138
Meta's Llama 3.2 language models & evals Collection by meta-llama Dec 13, 2024 48 meta-llama/Llama-3.2-1B Text Generation • Updated Oct 24, 2024 • 1.26M • • 1.48k meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated Oct 24, 2024 • 1.15M • • 715 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated Oct 24, 2024 • 1.51M • • 916 meta-llama/Llama-3.2-3B Text Generation • Updated Oct 24, 2024 • 427k • • 466
Phi-4 (All Versions) Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. Collection by unsloth 3 days ago 35 unsloth/phi-4-GGUF Text Generation • Updated 10 days ago • 61.2k • 121 unsloth/phi-4-unsloth-bnb-4bit Text Generation • Updated 10 days ago • 41.5k • 32 unsloth/phi-4 Text Generation • Updated 10 days ago • 14.7k • 65 unsloth/phi-4-bnb-4bit Text Generation • Updated 10 days ago • 3.02k • 10
Q-Series Sketch Q(n) Collection by strangerzonehf 3 days ago 7 strangerzonehf/Qx-Art Text-to-Image • Updated 3 days ago • 68 • • 9 strangerzonehf/Qw-Sketch Text-to-Image • Updated 3 days ago • 47 • • 9 strangerzonehf/Qd-Sketch Text-to-Image • Updated 3 days ago • 341 • • 12 strangerzonehf/Qs-Sketch Text-to-Image • Updated 3 days ago • 36 • • 9