alexwww94
/

glm-4v-9b-gptq-4bit

4-bit precision

Model card Files Files and versions Community

alexwww94 commited on Sep 4, 2024

Commit

8b1a098

·

verified ·

1 Parent(s): b34d9f3

Update README.md

Files changed (1) hide show

README.md +1 -4

README.md CHANGED Viewed

@@ -24,13 +24,10 @@ base_model: THUDM/glm-4v-9b
 ## Usage
 This model is quantized using [AutoGPTQ](https://github.com/AutoGPTQ/AutoGPTQ) for [THUDM/glm-4v-9b](https://huggingface.co/THUDM/glm-4v-9b).
-Use pip install AutoGPTQ (required)
 (The quantization script will be released later)
-```bash
-pip install auto-gptq
-```
 Since the original auto-gptq library does not support the quantization of chatglm models, manual import (hack) is required.
 ```python

 ## Usage
 This model is quantized using [AutoGPTQ](https://github.com/AutoGPTQ/AutoGPTQ) for [THUDM/glm-4v-9b](https://huggingface.co/THUDM/glm-4v-9b).
+It is recommended to install AutoGPTQ by compiling from the source code.
 (The quantization script will be released later)
 Since the original auto-gptq library does not support the quantization of chatglm models, manual import (hack) is required.
 ```python