Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
gemma-2-2b-it-quantized.w8a8
like
0
Follow
Neural Magic
277
Text Generation
Safetensors
gemma2
int8
vllm
conversational
8-bit precision
compressed-tensors
arxiv:
2210.17323
License:
gemma
Model card
Files
Files and versions
Community
alexmarques
commited on
Aug 16, 2024
Commit
0e84fac
·
verified
·
1 Parent(s):
2d65072
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
CHANGED
Viewed
@@ -4,6 +4,7 @@ tags:
4
- int8
5
- vllm
6
license: gemma
7
---
8
9
# gemma-2-2b-it-quantized.w8a8
4
- int8
5
- vllm
6
license: gemma
7
+
base_model: google/gemma-2-2b-it
8
---
9
10
# gemma-2-2b-it-quantized.w8a8