Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
's Collections
DeepSeek-R1-Distill Quantized
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community
Granite 3.1 Quantization
updated
Jan 24
Upvote
-
neuralmagic/granite-3.1-2b-instruct-quantized.w4a16
Text Generation
•
Updated
9 days ago
•
213
neuralmagic/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
Updated
9 days ago
•
58
neuralmagic/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
•
Updated
9 days ago
•
209
•
1
neuralmagic/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
Updated
9 days ago
•
84
•
1
neuralmagic/granite-3.1-2b-instruct-FP8-dynamic
Text Generation
•
Updated
Jan 28
•
66
neuralmagic/granite-3.1-8b-instruct-FP8-dynamic
Text Generation
•
Updated
Jan 25
•
59
•
1
neuralmagic/granite-3.1-2b-base-quantized.w4a16
Text Generation
•
Updated
9 days ago
•
55
neuralmagic/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
Updated
9 days ago
•
62
neuralmagic/granite-3.1-8b-base-FP8-dynamic
Text Generation
•
Updated
17 days ago
•
23
neuralmagic/granite-3.1-2b-base-FP8-dynamic
Text Generation
•
Updated
Jan 30
•
34
neuralmagic/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
Updated
9 days ago
•
51
neuralmagic/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
Updated
9 days ago
•
46
Upvote
-
Share collection
View history
Collection guide
Browse collections