Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

This is the largest collection of Persian models available on Huggingface

about 8 hours ago

Running

37

🥇

Leaderboard
bolbolzaban/gpt2-persian

Text Generation • Updated May 21, 2021 • 814 • 27
jonatasgrosman/wav2vec2-large-xlsr-53-persian

Automatic Speech Recognition • Updated Dec 14, 2022 • 289k • 21
m3hrdadfi/wav2vec2-large-xlsr-persian

Automatic Speech Recognition • Updated Nov 4, 2021 • 320 • 16

mlx-community/Mistral-Small-24B-Instruct-2501-4bit

Updated 3 days ago • 272 • 6
mlx-community/Mistral-Small-24B-Instruct-2501-3bit

Updated 3 days ago • 44 • 1
mlx-community/Mistral-Small-24B-Instruct-2501-6bit

Updated 3 days ago • 68
mlx-community/Mistral-Small-24B-Instruct-2501-8bit

Updated 3 days ago • 118

mlx-community/Qwen2.5-VL-72B-Instruct-4bit

Image-Text-to-Text • Updated 4 days ago • 176 • 2
mlx-community/Qwen2.5-VL-72B-Instruct-3bit

Image-Text-to-Text • Updated 4 days ago • 79 • 2
mlx-community/Qwen2.5-VL-7B-Instruct-bf16

Image-Text-to-Text • Updated 4 days ago • 226 • 2
mlx-community/Qwen2.5-VL-7B-Instruct-8bit

Image-Text-to-Text • Updated 4 days ago • 499 • 7

2025 January Papers 🧐

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 19 days ago • 271
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 11 days ago • 281
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 99
The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 20 days ago • 89

SFTvsRL Models & Data

tianzhechu/GP-VL-Init

Updated 5 days ago • 8
tianzhechu/GP-L-Init

Updated 6 days ago • 10
tianzhechu/VIRL-L-Init

Updated 4 days ago • 7 • 1
tianzhechu/VIRL-VL-Init

Updated 4 days ago • 4

mlx-community/Qwen2.5-7B-Instruct-1M-4bit

Text Generation • Updated 7 days ago • 363 • 6
mlx-community/Qwen2.5-7B-Instruct-1M-6bit

Text Generation • Updated 7 days ago • 43 • 1
mlx-community/Qwen2.5-7B-Instruct-1M-3bit

Text Generation • Updated 7 days ago • 23
mlx-community/Qwen2.5-7B-Instruct-1M-8bit

Text Generation • Updated 7 days ago • 65

Phi-4 (All Versions)

Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes.

unsloth/phi-4-GGUF

Text Generation • Updated 20 days ago • 70.7k • 132
unsloth/phi-4-unsloth-bnb-4bit

Text Generation • Updated 20 days ago • 61.7k • 37
unsloth/phi-4

Text Generation • Updated 20 days ago • 20.4k • 68
unsloth/phi-4-bnb-4bit

Text Generation • Updated 20 days ago • 3.66k • 12

deepseek-ai/deepseek-vl2-tiny

Image-Text-to-Text • Updated Dec 18, 2024 • 29.1k • 90
deepseek-ai/deepseek-vl2-small

Image-Text-to-Text • Updated Dec 18, 2024 • 8.13k • 51
deepseek-ai/deepseek-vl2

Image-Text-to-Text • Updated Dec 18, 2024 • 4.32k • 166
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 12

Meta's Llama 3.2 language models & evals

meta-llama/Llama-3.2-1B

Text Generation • Updated Oct 24, 2024 • 1.44M • 1.52k
meta-llama/Llama-3.2-1B-Instruct

Text Generation • Updated Oct 24, 2024 • 1.45M • • 736
meta-llama/Llama-3.2-3B-Instruct

Text Generation • Updated Oct 24, 2024 • 1.5M • • 946
meta-llama/Llama-3.2-3B

Text Generation • Updated Oct 24, 2024 • 343k • 484

The collection of Cosmos models

nvidia/Cosmos-1.0-Guardrail

Updated 24 days ago • 6.6k • 42
nvidia/Cosmos-1.0-Autoregressive-4B

Updated 24 days ago • 2.35k • 46

Previous
1
...
4
5
6
7
8
...
9,099
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs