Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
compressed-tensors
Inference Endpoints
AutoTrain Compatible
text-generation-inference
8-bit precision
custom_code
Eval Results
Merge
Misc with no match
4-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
719
Full-text search
Edit filters
Sort: Trending
Active filters:
compressed-tensors
Clear all
SicariusSicariiStuff/Wingless_Imp_8B_FP8
Updated
3 days ago
•
2
nm-testing/kyle-Qwen2-VL-72B-Instruct-W4A16-G128
Updated
2 days ago
•
2
saiscorelabsai/Llama-3.2-1B-Instruct-FP8-Dynamic
Text Generation
•
Updated
1 day ago
•
3
saiscorelabsai/Llama-3.2-1B-Instruct-FP8-KV
Text Generation
•
Updated
1 day ago
•
6
saiscorelabsai/Llama-3.2-1B-Instruct-W4A16-G128
Text Generation
•
Updated
1 day ago
•
3
saiscorelabsai/Llama-3.2-3B-Instruct-W4A16-G128
Text Generation
•
Updated
1 day ago
•
2
saiscorelabsai/Llama-3.2-3B-Instruct-FP8-KV
Text Generation
•
Updated
1 day ago
•
2
saiscorelabsai/Llama-3.2-3B-Instruct-FP8-Dynamic
Text Generation
•
Updated
1 day ago
•
3
leon-se/SmolVLM-Instruct-W4A16-G128
Image-Text-to-Text
•
Updated
about 10 hours ago
•
2
nm-testing/DeepSeek-R1-Distill-Qwen-14B-FP8-Dynamic
Updated
about 8 hours ago
nm-testing/granite-3.1-8b-instruct-FP8-Dynamic
Updated
about 7 hours ago
nm-testing/DeepSeek-R1-Distill-Qwen-14B-W8A8-Dynamic-Per-Token
Updated
about 7 hours ago
nm-testing/granite-8b-code-instruct-128k-FP8-Dynamic
Updated
about 7 hours ago
nm-testing/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128
Updated
about 7 hours ago
nm-testing/Mistral-7B-Instruct-v0.3-FP8-Dynamic
Updated
about 7 hours ago
nm-testing/DeepSeek-R1-Distill-Qwen-14B2of4-sparse
Updated
about 6 hours ago
nm-testing/DeepSeek-R1-Distill-Qwen-14B2of4-W8A8-FP8-Dynamic-Per-Token
Updated
about 6 hours ago
nm-testing/granite-3.1-8b-instruct-W8A8-Dynamic-Per-Token
Updated
about 6 hours ago
nm-testing/granite-3.1-8b-instruct-W4A16-G128
Updated
about 6 hours ago
nm-testing/granite-8b-code-instruct-128k-W8A8-Dynamic-Per-Token
Updated
about 6 hours ago
nm-testing/granite-8b-code-instruct-128k-W4A16-G128
Updated
about 6 hours ago
nm-testing/Mistral-7B-Instruct-v0.3-W8A8-Dynamic-Per-Token
Updated
about 6 hours ago
nm-testing/Mistral-7B-Instruct-v0.3-W4A16-G128
Updated
about 6 hours ago
nm-testing/granite-3.1-8b-instruct2of4-sparse
Updated
about 4 hours ago
nm-testing/granite-3.1-8b-instruct2of4-W8A8-FP8-Dynamic-Per-Token
Updated
about 4 hours ago
nm-testing/granite-8b-code-instruct-128k2of4-sparse
Updated
about 3 hours ago
nm-testing/granite-8b-code-instruct-128k2of4-W8A8-FP8-Dynamic-Per-Token
Updated
about 3 hours ago
nm-testing/Mistral-7B-Instruct-v0.32of4-sparse
Updated
about 3 hours ago
nm-testing/Mistral-7B-Instruct-v0.32of4-W8A8-FP8-Dynamic-Per-Token
Updated
about 3 hours ago
Previous
1
...
22
23
24
Next