Edit Models filters

Inference status

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

Mixture of Experts

Carbon Emissions

text-embeddings-inference

8-bit precision

4-bit precision

Models

10,138

Full-text search

Active filters: llama-cpp

ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q5_K_M-GGUF

Updated Oct 31 • 28 • 2

ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q4_K_M-GGUF

Updated Oct 31 • 4 • 2

ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q4_0-GGUF

Updated Oct 31 • 141 • 2

ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q5_0-GGUF

Updated Oct 31 • 2 • 1

ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q5_K_S-GGUF

Updated Oct 31 • 2 • 1

ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q4_K_S-GGUF

Updated Oct 31 • 2 • 2

aashish1904/Llama-3.1-Swallow-8B-v0.1-Q4_K_M-GGUF

Text Generation • Updated Oct 31 • 6 • 1

singhjagpreet/Llama-3.2-1B-Instruct-Q8_0-GGUF

Text Generation • Updated Oct 31 • 1

aashish1904/mistral-rrc-Q4_K_M-GGUF

Updated Oct 31 • 7 • 1

NikolayKozloff/Meraj-Mini-Q8_0-GGUF

Text2Text Generation • Updated Oct 31 • 16 • 1

andito/SmolLM2-1.7B-Instruct-F16-GGUF

Updated Oct 31 • 479 • 1

NikolayKozloff/SmolLM2-1.7B-Instruct-Q8_0-GGUF

Updated Oct 31 • 12 • 1

NikolayKozloff/SmolLM2-1.7B-Q8_0-GGUF

Updated Oct 31 • 10 • 1

NikolayKozloff/SmolLM2-360M-Instruct-Q8_0-GGUF

Updated Oct 31 • 6 • 1

NikolayKozloff/SmolLM2-135M-Instruct-Q8_0-GGUF

Updated Oct 31 • 9 • 1

HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF

Text Generation • Updated Nov 5 • 2.04k • 31

HuggingFaceTB/SmolLM2-360M-Instruct-GGUF

Updated Oct 31 • 1.37k • 17

Trappu/Stellar-Picaro-0.7-12B-Q5_K_M-GGUF

Updated Nov 1 • 5 • 1

bunnycore/LLama-3.2-1B-General-lora_model-F16-GGUF

Updated Nov 1 • 104 • 1

NikolayKozloff/AMD-OLMo-1B-Q8_0-GGUF

Updated Nov 1 • 19 • 1

NikolayKozloff/AMD-OLMo-1B-SFT-Q8_0-GGUF

Updated Nov 1 • 7 • 1

NikolayKozloff/AMD-OLMo-1B-SFT-DPO-Q8_0-GGUF

Updated Nov 1 • 11 • 1

NickMystic/SmolLM2-135M-Q8_0-GGUF

Updated Nov 2 • 9 • 1

bunnycore/Qwen2.5-7B-Exp2-lora_model-Q8_0-GGUF

Updated Nov 3 • 10 • 1

ryuzakizaki/Qwen2-Boundless-Q4_K_M-GGUF

Text2Text Generation • Updated Nov 3 • 58 • 1

marroyo777/Llama-3.2-1B-Instruct-IQ4_XS-GGUF

Text Generation • Updated Nov 3 • 9 • 1

NikolayKozloff/Phi-3-medium-4k-instruct-sq-LORA-F16-GGUF

Text Generation • Updated Nov 5 • 37 • 1

NikolayKozloff/Phi-3-medium-4k-instruct-sq-LORA-F32-GGUF

Text Generation • Updated Nov 5 • 31 • 1

NikolayKozloff/Phi-3-medium-4k-instruct-sq-LORA-Q8_0-GGUF

Text Generation • Updated Nov 5 • 8 • 1

NikolayKozloff/Phi-3-mini-4k-instruct-sq-LORA-F32-GGUF

Text Generation • Updated Nov 5 • 41 • 1