Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

260

Full-text search

Active filters: llama.cpp

TomoDG/EtherealAurora-MN-Nemo-12B-GGUF

Text Generation • 12B • Updated Apr 27 • 96

heyIamUmair/llama-3.2-3b-merged-gguf

3B • Updated Apr 27 • 9

jedisct1/MiMo-7B-RL-GGUF

8B • Updated Apr 30 • 429 • 23

Baskar2005/deepseek_Sunfall_Merged_Model

8B • Updated May 8 • 16

Baskar2005/deepseek_sunfall_merged_model_GGUF

8B • Updated May 8 • 3

RDson/Qwen3-30B-A3B-By-Expert-Quantization-GGUF

31B • Updated May 9 • 26 • 1

sychonix/OlympicCoder-7B-Sychonix

8B • Updated May 14 • 1 • 1

tifin-india/sarvam-m-24b-q6-k-gguf

Text Generation • 24B • Updated May 24 • 7 • 1

tifin-india/sarvam-m-24b-q5-1-gguf

Text Generation • 24B • Updated May 24 • 8

tifin-india/sarvam-m-24b-q2-k-gguf

Text Generation • 24B • Updated May 24 • 6

tifin-india/sarvam-m-24b-f16-gguf

Text Generation • 24B • Updated May 24 • 5

tifin-india/sarvam-m-24b-q3-k-l-gguf

Text Generation • 24B • Updated May 24 • 8

tifin-india/sarvam-m-24b-q3-k-s-gguf

Text Generation • 24B • Updated May 24 • 4

tifin-india/sarvam-m-24b-q3-k-gguf

Text Generation • 24B • Updated May 24 • 6

tifin-india/sarvam-m-24b-q4-k-m-gguf

Text Generation • 24B • Updated May 24 • 8 • 1

tifin-india/sarvam-m-24b-q3-k-m-gguf

Text Generation • 24B • Updated May 24 • 5

tifin-india/sarvam-m-24b-q4-k-s-gguf

Text Generation • 24B • Updated May 24 • 4

tifin-india/sarvam-m-24b-q5-k-m-gguf

Text Generation • 24B • Updated May 24 • 23 • 2

ykarout/MiMo-VL-7B-SFT-GGUF

Image-Text-to-Text • 8B • Updated Jun 2 • 16

XythicK/Qwen.Qwen2.5-Math-1.5B-GGUF

2B • Updated Jun 5 • 57

Govind222/Koyna-V2-1b-instruct-GGUF

1.0B • Updated Jun 5

agentlans/SmolLM2-135M-Instruct-GGUF

0.1B • Updated Jun 6 • 17

ReallyFloppyPenguin/Holo1-3B-GGUF

3B • Updated Jun 10 • 74 • 2

mgonzs13/SpaceOm-GGUF

Image-Text-to-Text • 3B • Updated 28 days ago • 88 • 1

Darkhn/L3.3-70B-Animus-V1-GGUF

71B • Updated Jun 16 • 182

allura-quants/allura-org_Q3-8B-Kintsugi-GGUF

ReallyFloppyPenguin/sarvam-m-GGUF

24B • Updated Jun 14 • 24 • 1

ReallyFloppyPenguin/DeepSeek-R1-0528-Qwen3-8B-GGUF

8B • Updated Jul 5 • 66

ReallyFloppyPenguin/MiniCPM4-8B-GGUF

8B • Updated Jun 14 • 13

ReallyFloppyPenguin/Nemotron-Research-Reasoning-Qwen-1.5B-GGUF

2B • Updated Jun 14 • 28 • 1