Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Novita
SambaNova
Fireworks
Cerebras
Nebius AI Studio
Replicate
Hyperbolic
Together AI
fal
HF Inference API
Misc
Reset Misc
compressed-tensors
Inference Endpoints
text-generation-inference
AutoTrain Compatible
8-bit precision
custom_code
Eval Results
Merge
Mixture of Experts
Misc with no match
4-bit precision
text-embeddings-inference
Carbon Emissions
Apply filters
Models
983
Full-text search
Edit filters
Sort: Trending
Active filters:
compressed-tensors
Clear all
nm-testing/granite-8b-code-instruct-128k-W8A8-Dynamic-Per-Token
Updated
Jan 26
•
7
nm-testing/granite-8b-code-instruct-128k-W4A16-G128
Updated
Jan 26
•
10
nm-testing/Mistral-7B-Instruct-v0.3-W8A8-Dynamic-Per-Token
Updated
Jan 26
•
8
nm-testing/Mistral-7B-Instruct-v0.3-W4A16-G128
Updated
Jan 26
•
12
nm-testing/granite-3.1-8b-instruct2of4-sparse
Updated
Jan 26
•
13
nm-testing/granite-3.1-8b-instruct2of4-W8A8-FP8-Dynamic-Per-Token
Updated
Jan 26
•
14
nm-testing/granite-8b-code-instruct-128k2of4-sparse
Updated
Jan 26
•
11
nm-testing/granite-8b-code-instruct-128k2of4-W8A8-FP8-Dynamic-Per-Token
Updated
Jan 26
•
10
nm-testing/Mistral-7B-Instruct-v0.32of4-sparse
Updated
Jan 26
•
8
nm-testing/Mistral-7B-Instruct-v0.32of4-W8A8-FP8-Dynamic-Per-Token
Updated
Jan 26
•
15
nexa-collaboration/output_llama3.1_8b_2of4_stage_sparsity_0.9
Updated
Jan 27
•
11
nexa-collaboration/output_llama3.1_8b_2of4_stage_finetuning_0.9
Updated
Jan 27
•
8
nexa-collaboration/output_llama3.1_8b_2of4_stage_quantization_0.9
Updated
Jan 27
•
9
SicariusSicariiStuff/Impish_QWEN_14B-1M_FP8
Updated
Jan 27
•
18
SicariusSicariiStuff/Impish_QWEN_7B-1M_FP8
Updated
Jan 27
•
7
leon-se/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Updated
16 days ago
•
8.68k
JamAndTeaStudios/gemma-2-9b-it-FP8-Dynamic
Text Generation
•
Updated
Jan 29
•
26
nejumi/Qwen2.5-14B-Instruct-1M-W8A8-calib-ja-1k
Updated
Jan 28
•
12
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-sparse24-layer-0-fp8-compressed
Updated
Jan 28
•
10
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-sparse24-layer-0-5-fp8-compressed
Updated
Jan 28
•
8
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-sparse24-0-5-remaining-fp8-compressed
Updated
Jan 28
•
10
JamAndTeaStudios/DeepSeek-R1-Distill-Qwen-1.5B-FP8-Dynamic
Text Generation
•
Updated
Jan 29
•
40
dwetzel/watt-tool-70B-GPTQ-INT4
Updated
Feb 4
•
75
JamAndTeaStudios/DeepSeek-R1-Distill-Llama-70B-FP8-Dynamic
Text Generation
•
Updated
Jan 29
•
274
nm-testing/whisper-large-v2-FP8-dynamic
Updated
Jan 28
•
14
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-full-sparse24
Updated
Jan 29
•
10
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-partial-24
Updated
Jan 29
•
11
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-partial-24-entire-fp8-compressed
Updated
Jan 29
•
9
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-partial-24-remaining-fp8-compressed
Updated
Jan 29
•
14
JamAndTeaStudios/Qwen2.5-7B-Instruct-1M-FP8-Dynamic
Text Generation
•
Updated
Jan 29
•
155
Previous
1
...
24
25
26
27
28
...
33
Next