Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
SambaNova
Fireworks
fal
Hyperbolic
Cerebras
Replicate
Novita
Together AI
Nebius AI Studio
HF Inference API
Misc
Reset Misc
compressed-tensors
Inference Endpoints
text-generation-inference
AutoTrain Compatible
8-bit precision
custom_code
Merge
Eval Results
Mixture of Experts
text-embeddings-inference
Misc with no match
4-bit precision
Carbon Emissions
Apply filters
Models
1,051
Full-text search
Edit filters
Sort: Trending
Active filters:
compressed-tensors
Clear all
stan-hua/Qwen2.5-32B-Instruct-LC-SmoothQuant-RTN-W8A16
Updated
Jan 17
•
5
stan-hua/Phi-3-small-7B-Instruct-LC-RTN-W4A16
Updated
Jan 20
•
4
stan-hua/Phi-3-small-7B-Instruct-LC-RTN-W8A8
Updated
Jan 20
•
6
stan-hua/Phi-3-small-7B-Instruct-LC-RTN-W8A16
Updated
Jan 20
•
4
zumalabs/DeepSeek-R1-Distill-Qwen-14B-FP8
Updated
Jan 20
•
23
•
3
zumalabs/DeepSeek-R1-Distill-Llama-70B-FP8
Updated
Jan 20
•
71.8k
•
5
zumalabs/DeepSeek-R1-Distill-Llama-8B-FP8
Updated
Jan 20
•
100
•
2
zumalabs/DeepSeek-R1-Distill-Qwen-7B-FP8
Updated
Jan 20
•
49
•
2
zumalabs/DeepSeek-R1-Distill-Qwen-1.5B-FP8
Updated
Jan 20
•
11
•
2
nm-testing/whisper-tiny-W4A16-G128
Updated
Jan 20
•
7
nm-testing/whisper-large-v2-W4A16-G128
Updated
Jan 21
•
7
alpindale/opt-125m-FP8-Dynamic
Updated
Jan 21
•
13
tolgaakar/Mistral-Small-Instruct-2409-FP8-Dynamic
Updated
Jan 21
•
6
tolgaakar/watt-tool-8B-FP8-Dynamic
Updated
Jan 21
•
105
shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-SQ-GPTQ-W8A8-INT8
Updated
Jan 21
•
7
shisa-ai/Mistral-Nemo-Japanese-Instruct-FP8-Dynamic
Updated
Jan 21
•
5
SicariusSicariiStuff/Eximius_Persona_5B_FP8
Updated
Jan 21
•
9
ExceedZhang/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128
Updated
Feb 2
•
69
nm-testing/DeepSeek-R1-Distill-Qwen-1.5B-W4A16-G128
Updated
Jan 22
•
4
nm-testing/granite-3.1-2b-instruct-W4A16-G128
Updated
Jan 22
•
4
nm-testing/llama2.c-stories42M-pruned2.4-compressed
Updated
Jan 22
•
16
mlinmg/Aurora-0125-W4A16-GPTQ
Text Generation
•
Updated
Jan 23
•
17
dwetzel/DeepSeek-R1-Distill-Qwen-32B-FP8-Dynamic
Text Generation
•
Updated
Jan 23
•
20
dwetzel/Qwen2.5-32B-Instruct-FP8-Dynamic
Text Generation
•
Updated
Jan 23
•
16
otherhalf-dev/Llama-3.3-70B-Instruct-abliterated-fp8
Updated
Jan 23
•
10
soprasteria/DeepSeek-R1-Distill-Llama-70B-FP8-KV
Text Generation
•
Updated
Jan 23
neuralmagic/Qwen2-VL-72B-Instruct-quantized.w4a16
Image-Text-to-Text
•
Updated
1 day ago
•
177
noneUsername/Wayfarer-12B-W8A8
Updated
Jan 24
•
5
SicariusSicariiStuff/Wingless_Imp_8B_FP8
Updated
Jan 24
•
6
just-add-ai/Llama-3.3-70B-Instruct-FP8-Dynamic
Text Generation
•
Updated
Feb 7
•
18
Previous
1
...
23
24
25
26
27
...
36
Next