Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
TGI
vLLM
Apps with no match
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
node-llama-cpp
Ollama
MLX LM
Inference Providers
Inference Providers with no match
fal
Cerebras
Together AI
SambaNova
Novita
Hyperbolic
Cohere
Featherless AI
Groq
Replicate
Nebius AI
Fireworks
Nscale
HF Inference API
Misc
Reset Misc
torchao-my-repo
Inference Endpoints
text-generation-inference
4-bit precision
Merge
Misc with no match
Eval Results
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
35
Full-text search
Edit filters
Sort: Trending
Active filters:
torchao-my-repo
Clear all
medmekk/Llama-3.2-1B-ao-int8wo
Text Generation
•
Updated
Mar 31
•
13
medmekk/Llama-3.2-1B-ao-int8da8w
Text Generation
•
Updated
Mar 31
•
12
medmekk/Llama-3.2-1B-ao-int8wo-gs16
Text Generation
•
Updated
Mar 31
•
32
medmekk/Llama-3.2-1B-ao-int8wo-gs32
Text Generation
•
Updated
Mar 31
•
11
medmekk/Qwen2.5-0.5B-Instruct-ao-int8wo-gs128
Text Generation
•
Updated
Mar 31
•
13
medmekk/Qwen2.5-0.5B-Instruct-ao-int8da8w
Text Generation
•
Updated
Mar 31
•
13
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int4wo-gs128
Text Generation
•
Updated
Apr 5
•
59
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int8wo-gs128
Text Generation
•
Updated
Apr 5
•
51
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int8da8w
Text Generation
•
Updated
Apr 16
•
184
•
1
medmekk/Llama-3.2-1B-ao-float8wo
Text Generation
•
Updated
Apr 22
•
17
medmekk/Llama-3.2-1B-ao-float8da8w
Text Generation
•
Updated
Apr 22
•
15
medmekk/Llama-3.2-1B-ao-autoquant-1
Text Generation
•
Updated
Apr 22
•
18
medmekk/Llama-3.2-1B-ao-float8wo-2
Text Generation
•
Updated
Apr 22
•
16
medmekk/Llama-3.2-1B-ao-float8wo-3
Text Generation
•
Updated
Apr 22
•
12
medmekk/Llama-3.2-1B-ao-int8wo-gs256
Text Generation
•
Updated
Apr 22
•
37
medmekk/Llama-3.2-1B-ao-int4wo-gs128
Text Generation
•
Updated
Apr 22
•
10
medmekk/Qwen2.5-0.5B-Instruct-ao-float8wo
Text Generation
•
Updated
Apr 22
•
14
medmekk/Llama-3.2-1B-ao-int4wo-gs256
Text Generation
•
Updated
Apr 22
•
11
medmekk/Llama-3.1-8B-Instruct-ao-int8wo
Text Generation
•
Updated
Apr 24
•
33
medmekk/Llama-3.1-8B-Instruct-ao-autoquant
Text Generation
•
Updated
Apr 24
•
14
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs128
Text Generation
•
Updated
Apr 24
•
9
medmekk/Llama-3.1-8B-Instruct-ao-float8wo
Text Generation
•
Updated
Apr 24
•
8
medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8
Text Generation
•
Updated
Apr 24
•
9
medmekk/Llama-3.1-8B-Instruct-ao-int8da8w8
Text Generation
•
Updated
Apr 24
•
8
medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8-2
Text Generation
•
Updated
Apr 24
•
23
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs32
Text Generation
•
Updated
Apr 24
•
16
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs16
Text Generation
•
Updated
Apr 24
•
16
irresistiblegrace97/tinyllama.gguf
Updated
Apr 24
•
11
andysalerno/Qwen3-8B-ao-autoquant
Text Generation
•
Updated
May 9
•
6
HexLang/GPT2
Updated
May 14
•
33
Previous
1
2
Next