Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

compressed-tensors

Inference Endpoints

text-generation-inference

AutoTrain Compatible

8-bit precision

Mixture of Experts

text-embeddings-inference

Misc with no match

4-bit precision

Carbon Emissions

Models

1,051

Full-text search

Active filters: compressed-tensors

stan-hua/Qwen2.5-32B-Instruct-LC-SmoothQuant-RTN-W8A16

Updated Jan 17 • 5

stan-hua/Phi-3-small-7B-Instruct-LC-RTN-W4A16

Updated Jan 20 • 4

stan-hua/Phi-3-small-7B-Instruct-LC-RTN-W8A8

Updated Jan 20 • 6

stan-hua/Phi-3-small-7B-Instruct-LC-RTN-W8A16

Updated Jan 20 • 4

zumalabs/DeepSeek-R1-Distill-Qwen-14B-FP8

Updated Jan 20 • 23 • 3

zumalabs/DeepSeek-R1-Distill-Llama-70B-FP8

Updated Jan 20 • 71.8k • 5

zumalabs/DeepSeek-R1-Distill-Llama-8B-FP8

Updated Jan 20 • 100 • 2

zumalabs/DeepSeek-R1-Distill-Qwen-7B-FP8

Updated Jan 20 • 49 • 2

zumalabs/DeepSeek-R1-Distill-Qwen-1.5B-FP8

Updated Jan 20 • 11 • 2

nm-testing/whisper-tiny-W4A16-G128

Updated Jan 20 • 7

nm-testing/whisper-large-v2-W4A16-G128

Updated Jan 21 • 7

alpindale/opt-125m-FP8-Dynamic

Updated Jan 21 • 13

tolgaakar/Mistral-Small-Instruct-2409-FP8-Dynamic

Updated Jan 21 • 6

tolgaakar/watt-tool-8B-FP8-Dynamic

Updated Jan 21 • 105

shisa-ai/Mistral-Nemo-Japanese-Instruct-2408-SQ-GPTQ-W8A8-INT8

Updated Jan 21 • 7

shisa-ai/Mistral-Nemo-Japanese-Instruct-FP8-Dynamic

Updated Jan 21 • 5

SicariusSicariiStuff/Eximius_Persona_5B_FP8

Updated Jan 21 • 9

ExceedZhang/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128

Updated Feb 2 • 69

nm-testing/DeepSeek-R1-Distill-Qwen-1.5B-W4A16-G128

Updated Jan 22 • 4

nm-testing/granite-3.1-2b-instruct-W4A16-G128

Updated Jan 22 • 4

nm-testing/llama2.c-stories42M-pruned2.4-compressed

Updated Jan 22 • 16

mlinmg/Aurora-0125-W4A16-GPTQ

Text Generation • Updated Jan 23 • 17

dwetzel/DeepSeek-R1-Distill-Qwen-32B-FP8-Dynamic

Text Generation • Updated Jan 23 • 20

dwetzel/Qwen2.5-32B-Instruct-FP8-Dynamic

Text Generation • Updated Jan 23 • 16

otherhalf-dev/Llama-3.3-70B-Instruct-abliterated-fp8

Updated Jan 23 • 10

soprasteria/DeepSeek-R1-Distill-Llama-70B-FP8-KV

Text Generation • Updated Jan 23

neuralmagic/Qwen2-VL-72B-Instruct-quantized.w4a16

Image-Text-to-Text • Updated 1 day ago • 177

noneUsername/Wayfarer-12B-W8A8

Updated Jan 24 • 5

SicariusSicariiStuff/Wingless_Imp_8B_FP8

Updated Jan 24 • 6

just-add-ai/Llama-3.3-70B-Instruct-FP8-Dynamic

Text Generation • Updated Feb 7 • 18