Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
fp8
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
8-bit precision
Merge
Eval Results
Mixture of Experts
Misc with no match
4-bit precision
text-embeddings-inference
Carbon Emissions
Apply filters
Models
317
Full-text search
Edit filters
Sort: Trending
Active filters:
fp8
Clear all
raja-nectar/Lumimaid-70B-FP8-OAS
Text Generation
•
Updated
Jul 12
•
6
Ksgk-fy/maria-v2-fp8-dynamic
Text Generation
•
Updated
Jul 12
Ksgk-fy/maria-v2-fp8-static
Text Generation
•
Updated
Jul 12
•
7
Ksgk-fy/maria_v113-fp8-dynamic
Text Generation
•
Updated
Jul 13
Ksgk-fy/maria_v114-fp8-dynamic
Text Generation
•
Updated
Jul 13
•
7
Ksgk-fy/maria_v115-fp8-dynamic
Text Generation
•
Updated
Jul 14
•
4
nm-testing/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
•
Updated
Jul 14
•
10
neuralmagic/Qwen2-57B-A14B-Instruct-FP8
Text Generation
•
Updated
Jul 18
•
721
•
1
nm-testing/Qwen2-1.5B-Instruct-FP8-K-V
Text Generation
•
Updated
Jul 16
•
1.42k
nm-testing/Meta-Llama-3-8B-Instruct-FP8-K-V
Text Generation
•
Updated
Oct 9
•
11
neuralmagic/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
•
Updated
Jul 18
•
8.8k
•
6
neuralmagic/DeepSeek-Coder-V2-Lite-Base-FP8
Text Generation
•
Updated
Jul 18
•
78
Rallio67/llama3-70b-exab-fp8
Text Generation
•
Updated
Jul 18
•
6
mgoin/Mistral-Nemo-Instruct-2407-FP8-Dynamic
Text Generation
•
Updated
Jul 18
•
441
mgoin/Mistral-Nemo-Instruct-2407-FP8-KV
Text Generation
•
Updated
Jul 18
•
13
obamaTeo/llama-finetune-8bit-wiki-252-ver2
Text Generation
•
Updated
Jul 18
•
10
FlorianJc/Mistral-Nemo-Instruct-2407-vllm-fp8
Text Generation
•
Updated
Jul 31
•
24.1k
•
8
darthhexx/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
Updated
Jul 22
•
5
mgoin/Nemotron-4-340B-Instruct-FP8-Dynamic
Text Generation
•
Updated
Jul 23
•
8
neuralmagic/DeepSeek-Coder-V2-Base-FP8
Text Generation
•
Updated
Jul 22
•
35
mgoin/Minitron-4B-Base-FP8
Text Generation
•
Updated
Aug 16
•
878
•
3
mgoin/Minitron-8B-Base-FP8
Text Generation
•
Updated
Jul 26
•
24
•
3
nm-testing/Qwen2-0.5B-Instruct-FP8-SkipQKV
Text Generation
•
Updated
Jul 23
•
1.96k
neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8-dynamic
Text Generation
•
Updated
Oct 19
•
1.64k
•
5
PrimeIntellect/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
Jul 23
•
27
darthhexx/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
Jul 24
•
21
mgoin/Nemotron-4-340B-Base-hf-FP8
Text Generation
•
Updated
Aug 8
•
30
•
2
FlorianJc/ghost-8b-beta-vllm-fp8
Text Generation
•
Updated
Jul 25
•
22
FlorianJc/Meta-Llama-3.1-8B-Instruct-vllm-fp8
Text Generation
•
Updated
Jul 25
•
403
iwaitu/llama-3.1-70b-chinese-chat-FP8
Text Generation
•
Updated
Aug 28
•
19
Previous
1
2
3
4
5
6
...
11
Next