DeepSeek-R1 Collection by deepseek-ai 2 days ago 92 deepseek-ai/DeepSeek-R1 Text Generation • Updated about 18 hours ago • 20.1k • 1.81k deepseek-ai/DeepSeek-R1-Zero Text Generation • Updated about 18 hours ago • 1.61k • 353 deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • Updated about 18 hours ago • 6.17k • 188 deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 18 hours ago • 50.7k • • 364
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated about 18 hours ago • 50.7k • • 364
DeepSeek R1 (All Versions) DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. Collection by unsloth 2 days ago 52 unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF Updated 2 days ago • 27.8k • 80 unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF Updated 2 days ago • 18.9k • 45 unsloth/DeepSeek-R1-Distill-Qwen-14B-GGUF Updated 3 days ago • 11.8k • 30 unsloth/DeepSeek-R1-Distill-Qwen-7B-GGUF Updated 3 days ago • 10.6k • 36
Cosmos The collection of Cosmos models Collection by nvidia 6 days ago 244 nvidia/Cosmos-1.0-Guardrail Updated 13 days ago • 5.18k • 41 nvidia/Cosmos-1.0-Autoregressive-4B Updated 13 days ago • 2.04k • 46
DeepSeek-V3 Collection by deepseek-ai 17 days ago 128 deepseek-ai/DeepSeek-V3-Base Updated 24 days ago • 19.1k • 1.3k deepseek-ai/DeepSeek-V3 Updated 24 days ago • 185k • 2.18k DeepSeek-V3 Technical Report Paper • 2412.19437 • Published 27 days ago • 27
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen Nov 28, 2024 472 Running 613 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated Sep 25, 2024 • 328k • 162 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated Sep 25, 2024 • 570k • 190 Qwen/Qwen2.5-1.5B Text Generation • Updated Oct 8, 2024 • 71k • 56
GTE ModernBERT GTE Models Based on ModernBERT Collection by Alibaba-NLP 2 days ago 10 Alibaba-NLP/gte-modernbert-base Sentence Similarity • Updated about 14 hours ago • 61 • 43 Alibaba-NLP/gte-reranker-modernbert-base Sentence Similarity • Updated about 16 hours ago • 36 • 23
Jan 17 Releases ❄️ Models and datasets of the second week of Jan 2025. Collection by merve 6 days ago 10 openbmb/MiniCPM-o-2_6 Any-to-Any • Updated about 14 hours ago • 39.2k • 764 MiniMaxAI/MiniMax-Text-01 Text Generation • Updated 6 days ago • 3.6k • 462 OuteAI/OuteTTS-0.3-1B Text-to-Speech • Updated 6 days ago • 7.28k • 79 NovaSky-AI/Sky-T1_data_17k Viewer • Updated 9 days ago • 16.4k • 2.71k • 138
Meta's Llama 3.2 language models & evals Collection by meta-llama Dec 13, 2024 47 meta-llama/Llama-3.2-1B Text Generation • Updated Oct 24, 2024 • 1.24M • 1.48k meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated Oct 24, 2024 • 1.1M • • 715 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated Oct 24, 2024 • 1.49M • • 916 meta-llama/Llama-3.2-3B Text Generation • Updated Oct 24, 2024 • 440k • 465
Phi-4 (All Versions) Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. Collection by unsloth 3 days ago 35 unsloth/phi-4-GGUF Text Generation • Updated 9 days ago • 56.4k • 119 unsloth/phi-4-unsloth-bnb-4bit Text Generation • Updated 9 days ago • 39.3k • 32 unsloth/phi-4 Text Generation • Updated 9 days ago • 14.1k • 65 unsloth/phi-4-bnb-4bit Text Generation • Updated 9 days ago • 2.96k • 10
Q-Series Sketch Q(n) Collection by strangerzonehf 3 days ago 7 strangerzonehf/Qx-Art Text-to-Image • Updated 2 days ago • 56 • • 9 strangerzonehf/Qw-Sketch Text-to-Image • Updated 2 days ago • 38 • • 9 strangerzonehf/Qd-Sketch Text-to-Image • Updated 2 days ago • 259 • • 12 strangerzonehf/Qs-Sketch Text-to-Image • Updated 2 days ago • 16 • • 9