pooja-ganesh commited on
Commit
32fbbb0
·
verified ·
1 Parent(s): 470ab0c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -3
README.md CHANGED
@@ -1,3 +1,69 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ base_model:
6
+ - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
7
+ pipeline_tag: text-generation
8
+ ---
9
+
10
+ # DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid
11
+ - ## Introduction
12
+ This model was created by applying [Quark](https://quark.docs.amd.com/latest/index.html) with calibration samples from Pile dataset.
13
+ - ## Quantization Strategy
14
+ - ***Quantized Layers***: All linear layers
15
+ - ***Weight***: uint4 asymmetric per-group, group_size=128
16
+ - ## Quick Start
17
+ 1. [Download and install Quark](https://quark.docs.amd.com/latest/install.html)
18
+ 2. Run the quantization script in the example folder using the following command line:
19
+ ```sh
20
+ export MODEL_DIR = [local model checkpoint folder] or DeepSeek-R1-Distill-Llama-8B
21
+ # single GPU
22
+ python quantize_quark.py --model_dir $MODEL_DIR \
23
+ --output_dir output_dir $MODEL_NAME-awq-asym-uint4-g128-lmhead \
24
+ --quant_scheme w_uint4_per_group_asym \
25
+ --num_calib_data 128 \
26
+ --quant_algo awq \
27
+ --dataset pileval_for_awq_benchmark \
28
+ --seq_len 512 \
29
+ --model_export hf_format \
30
+ --data_type bfloat16 \
31
+ --exclude_layers
32
+ # cpu
33
+ python quantize_quark.py --model_dir $MODEL_DIR \
34
+ --output_dir output_dir $MODEL_NAME-awq-asym-uint4-g128-lmhead \
35
+ --quant_scheme w_uint4_per_group_asym \
36
+ --num_calib_data 128 \
37
+ --quant_algo awq \
38
+ --dataset pileval_for_awq_benchmark \
39
+ --seq_len 512 \
40
+ --model_export hf_format \
41
+ --data_type bfloat16 \
42
+ --exclude_layers \
43
+ --device cpu
44
+ ```
45
+
46
+ #### License
47
+ Modifications copyright(c) 2024 Advanced Micro Devices,Inc. All rights reserved.
48
+
49
+ MIT License
50
+
51
+ Copyright (c) 2023 DeepSeek
52
+
53
+ Permission is hereby granted, free of charge, to any person obtaining a copy
54
+ of this software and associated documentation files (the "Software"), to deal
55
+ in the Software without restriction, including without limitation the rights
56
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
57
+ copies of the Software, and to permit persons to whom the Software is
58
+ furnished to do so, subject to the following conditions:
59
+
60
+ The above copyright notice and this permission notice shall be included in all
61
+ copies or substantial portions of the Software.
62
+
63
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
64
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
65
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
66
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
67
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
68
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
69
+ SOFTWARE.