Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
's Collections
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community
Granite 3.1 Quantization
updated
11 days ago
Upvote
-
neuralmagic/granite-3.1-2b-instruct-quantized.w4a16
Text Generation
•
Updated
11 days ago
•
235
neuralmagic/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
Updated
11 days ago
•
200
neuralmagic/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
•
Updated
5 days ago
•
162
•
1
neuralmagic/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
Updated
11 days ago
•
176
neuralmagic/granite-3.1-2b-instruct-FP8-dynamic
Text Generation
•
Updated
7 days ago
•
73
neuralmagic/granite-3.1-8b-instruct-FP8-dynamic
Text Generation
•
Updated
11 days ago
•
89
•
1
neuralmagic/granite-3.1-2b-base-quantized.w4a16
Text Generation
•
Updated
5 days ago
•
36
neuralmagic/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
Updated
5 days ago
•
26
neuralmagic/granite-3.1-8b-base-FP8-dynamic
Updated
5 days ago
neuralmagic/granite-3.1-2b-base-FP8-dynamic
Text Generation
•
Updated
5 days ago
•
40
neuralmagic/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
Updated
5 days ago
•
32
neuralmagic/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
Updated
5 days ago
•
11
Upvote
-
Share collection
View history
Collection guide
Browse collections