AIFunOver commited on
Commit
715cb1b
1 Parent(s): 4366772

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: THUDM/glm-4-9b-chat
3
+ language:
4
+ - zh
5
+ - en
6
+ license: other
7
+ license_name: glm-4
8
+ license_link: https://huggingface.co/THUDM/glm-4-9b-chat/blob/main/LICENSE
9
+ tags:
10
+ - glm
11
+ - chatglm
12
+ - thudm
13
+ - openvino
14
+ - nncf
15
+ - fp16
16
+ inference: false
17
+ ---
18
+
19
+ This model is a quantized version of [`THUDM/glm-4-9b-chat`](https://huggingface.co/THUDM/glm-4-9b-chat) and is converted to the OpenVINO format. This model was obtained via the [nncf-quantization](https://huggingface.co/spaces/echarlaix/nncf-quantization) space with [optimum-intel](https://github.com/huggingface/optimum-intel).
20
+ First make sure you have `optimum-intel` installed:
21
+ ```bash
22
+ pip install optimum[openvino]
23
+ ```
24
+ To load your model you can do as follows:
25
+ ```python
26
+ from optimum.intel import OVModelForCausalLM
27
+ model_id = "AIFunOver/glm-4-9b-chat-openvino-fp16"
28
+ model = OVModelForCausalLM.from_pretrained(model_id)
29
+ ```