notstoic
/

pygmalion-13b-4bit-128g

Text Generation

text-generation-inference

Model card Files Files and versions Community

Pernekhan commited on Jan 31

Commit

5f7a136

•

1 Parent(s): eece8c4

Create quantize_config.json

This is to make it work with engines like vLLM

Files changed (1) hide show

quantize_config.json +6 -0

quantize_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "bits": 4,
+  "desc_act": false,
+  "group_size": 128,
+  "true_sequential": true
+}