-
-
-
-
-
-
Inference Providers
Active filters:
int8
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
17
•
1
Weblet/Llama-2-7b-chat-hf-ct2-int8
Text Generation
•
Updated
•
96
ecastera/eva-dolphin-llama3-8b-spanish
Text Generation
•
Updated
•
98
•
4
Anthonyg5005/L3-8B-Stheno-v3.1-int8-ct2
Text Generation
•
Updated
•
10
Anthonyg5005/turbcat-instruct-8b-int8-ct2
Text Generation
•
Updated
•
13
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
•
Updated
•
35
•
2
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
3.87k
•
9
angeloc1/llama3dot1SimilarProcesses8
Text Generation
•
Updated
•
8
angeloc1/llama3dot1DifferentProcesses8
Text Generation
•
Updated
•
6
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
4.03k
•
19
FriendliAI/Meta-Llama-3-8B-int8
Text Generation
•
Updated
•
24
•
1
neuralmagic/Meta-Llama-3.1-8B-quantized.w8a16
Text Generation
•
Updated
•
180
•
1
neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8
Text Generation
•
Updated
•
1.27k
•
2
neuralmagic/gemma-2-9b-it-quantized.w8a16
Text Generation
•
Updated
•
560
•
1
neuralmagic/gemma-2-2b-it-quantized.w8a16
Text Generation
•
Updated
•
26
•
1
neuralmagic/gemma-2-2b-quantized.w8a16
Text Generation
•
Updated
•
42
neuralmagic/SmolLM-1.7B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
33
neuralmagic/gemma-2-2b-it-quantized.w8a8
Text Generation
•
Updated
•
123
angeloc1/llama3dot1FoodDel8v02
Text Generation
•
Updated
•
9
neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
130
•
2
neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
172
•
2
neuralmagic/SmolLM-360M-Instruct-quantized.w8a8
Text Generation
•
Updated
•
22
neuralmagic/SmolLM-135M-Instruct-quantized.w8a8
Text Generation
•
Updated
•
108
neuralmagic/SmolLM-1.7B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
9
•
1
zzzmahesh/Meta-Llama-3-8B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
15
FriendliAI/Meta-Llama-3.1-8B-Instruct-int8
Text Generation
•
Updated
•
42
•
1
FriendliAI/Meta-Llama-3.1-70B-Instruct-int8
Text Generation
•
Updated
•
12
neuralmagic/Qwen2.5-0.5B-quantized.w8a16
Text Generation
•
Updated
•
40
neuralmagic/Qwen2.5-1.5B-quantized.w8a16
Text Generation
•
Updated
•
32
neuralmagic/Qwen2.5-3B-quantized.w8a16
Text Generation
•
Updated
•
28