Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
llama-cpp
Inference Endpoints
Merge
Eval Results
text-generation-inference
AutoTrain Compatible
Mixture of Experts
Carbon Emissions
text-embeddings-inference
custom_code
8-bit precision
4-bit precision
Apply filters
Models
10,138
Full-text search
Edit filters
Sort: Trending
Active filters:
llama-cpp
Clear all
ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q5_K_M-GGUF
Updated
Oct 31
•
28
•
2
ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q4_K_M-GGUF
Updated
Oct 31
•
4
•
2
ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q4_0-GGUF
Updated
Oct 31
•
141
•
2
ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q5_0-GGUF
Updated
Oct 31
•
2
•
1
ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q5_K_S-GGUF
Updated
Oct 31
•
2
•
1
ZeroXClem/Llama-3-Aetheric-Hermes-Lexi-Smaug-8B-Q4_K_S-GGUF
Updated
Oct 31
•
2
•
2
aashish1904/Llama-3.1-Swallow-8B-v0.1-Q4_K_M-GGUF
Text Generation
•
Updated
Oct 31
•
6
•
1
singhjagpreet/Llama-3.2-1B-Instruct-Q8_0-GGUF
Text Generation
•
Updated
Oct 31
•
1
aashish1904/mistral-rrc-Q4_K_M-GGUF
Updated
Oct 31
•
7
•
1
NikolayKozloff/Meraj-Mini-Q8_0-GGUF
Text2Text Generation
•
Updated
Oct 31
•
16
•
1
andito/SmolLM2-1.7B-Instruct-F16-GGUF
Updated
Oct 31
•
479
•
1
NikolayKozloff/SmolLM2-1.7B-Instruct-Q8_0-GGUF
Updated
Oct 31
•
12
•
1
NikolayKozloff/SmolLM2-1.7B-Q8_0-GGUF
Updated
Oct 31
•
10
•
1
NikolayKozloff/SmolLM2-360M-Instruct-Q8_0-GGUF
Updated
Oct 31
•
6
•
1
NikolayKozloff/SmolLM2-135M-Instruct-Q8_0-GGUF
Updated
Oct 31
•
9
•
1
HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF
Text Generation
•
Updated
Nov 5
•
2.04k
•
31
HuggingFaceTB/SmolLM2-360M-Instruct-GGUF
Updated
Oct 31
•
1.37k
•
17
Trappu/Stellar-Picaro-0.7-12B-Q5_K_M-GGUF
Updated
Nov 1
•
5
•
1
bunnycore/LLama-3.2-1B-General-lora_model-F16-GGUF
Updated
Nov 1
•
104
•
1
NikolayKozloff/AMD-OLMo-1B-Q8_0-GGUF
Updated
Nov 1
•
19
•
1
NikolayKozloff/AMD-OLMo-1B-SFT-Q8_0-GGUF
Updated
Nov 1
•
7
•
1
NikolayKozloff/AMD-OLMo-1B-SFT-DPO-Q8_0-GGUF
Updated
Nov 1
•
11
•
1
NickMystic/SmolLM2-135M-Q8_0-GGUF
Updated
Nov 2
•
9
•
1
bunnycore/Qwen2.5-7B-Exp2-lora_model-Q8_0-GGUF
Updated
Nov 3
•
10
•
1
ryuzakizaki/Qwen2-Boundless-Q4_K_M-GGUF
Text2Text Generation
•
Updated
Nov 3
•
58
•
1
marroyo777/Llama-3.2-1B-Instruct-IQ4_XS-GGUF
Text Generation
•
Updated
Nov 3
•
9
•
1
NikolayKozloff/Phi-3-medium-4k-instruct-sq-LORA-F16-GGUF
Text Generation
•
Updated
Nov 5
•
37
•
1
NikolayKozloff/Phi-3-medium-4k-instruct-sq-LORA-F32-GGUF
Text Generation
•
Updated
Nov 5
•
31
•
1
NikolayKozloff/Phi-3-medium-4k-instruct-sq-LORA-Q8_0-GGUF
Text Generation
•
Updated
Nov 5
•
8
•
1
NikolayKozloff/Phi-3-mini-4k-instruct-sq-LORA-F32-GGUF
Text Generation
•
Updated
Nov 5
•
41
•
1
Previous
1
...
5
6
7
8
9
...
100
Next