jyhong836 commited on
Commit
29b7dc1
·
1 Parent(s): 83b7e94

Update model list

Browse files
Files changed (1) hide show
  1. README.md +41 -34
README.md CHANGED
@@ -92,40 +92,47 @@ outputs = model.generate(input_ids=input_ids, max_length=128)
92
  tokenizer.decode(outputs[0])
93
  ```
94
 
95
- | | Base Model | Model Size | Compression Method | Compression Degree |
96
- |---:|:-------------|:-------------|:----------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|
97
- | 0 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.1](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.1) |
98
- | 1 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.2](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.2) |
99
- | 2 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.3](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.3) |
100
- | 3 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.5](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.5) |
101
- | 4 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.6](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.6) |
102
- | 5 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.1](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.1) |
103
- | 6 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.2](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.2) |
104
- | 7 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.3](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.3) |
105
- | 8 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.5](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.5) |
106
- | 9 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.6](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.6) |
107
- | 10 | Llama-2 | 7b | [wanda_gptq](https://huggingface.co/vita-group/llama-2-7b_wanda_2_4_gptq_4bit_128g) | 4bit_128g |
108
- | 11 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.1](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.1) |
109
- | 12 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.2](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.2) |
110
- | 13 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.3](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.3) |
111
- | 14 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.5](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.5) |
112
- | 15 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.6](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.6) |
113
- | 16 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [10bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/10bit_128g) |
114
- | 17 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [12bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/12bit_128g) |
115
- | 18 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [14bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/14bit_128g) |
116
- | 19 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [2bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/2bit_128g) |
117
- | 20 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [3bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/3bit_128g) |
118
- | 21 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [4bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/4bit_128g) |
119
- | 22 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [6bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/6bit_128g) |
120
- | 23 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [8bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/8bit_128g) |
121
- | 24 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [10bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/10bit_128g) |
122
- | 25 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [12bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/12bit_128g) |
123
- | 26 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [14bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/14bit_128g) |
124
- | 27 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [2bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/2bit_128g) |
125
- | 28 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [3bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/3bit_128g) |
126
- | 29 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [4bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/4bit_128g) |
127
- | 30 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [6bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/6bit_128g) |
128
- | 31 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [8bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/8bit_128g) |
 
 
 
 
 
 
 
129
 
130
 
131
  For any question, please contact [Junyuan Hong](mailto:[email protected]).
 
92
  tokenizer.decode(outputs[0])
93
  ```
94
 
95
+
96
+ | | Base Model | Model Size | Compression Method | Compression Degree |
97
+ |---:|:-------------|:-------------|:------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------|
98
+ | 0 | Llama-2 | 13b | [magnitude_semistruct](https://huggingface.co/vita-group/llama-2-13b_magnitude_semistruct) | [0.5_2to4](https://huggingface.co/vita-group/llama-2-13b_magnitude_semistruct/tree/0.5_2to4) |
99
+ | 1 | Llama-2 | 13b | [sparsegpt_semistruct](https://huggingface.co/vita-group/llama-2-13b_sparsegpt_semistruct) | [0.5_2to4](https://huggingface.co/vita-group/llama-2-13b_sparsegpt_semistruct/tree/0.5_2to4) |
100
+ | 2 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.1](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.1) |
101
+ | 3 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.2](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.2) |
102
+ | 4 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.3](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.3) |
103
+ | 5 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.5](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.5) |
104
+ | 6 | Llama-2 | 7b | [magnitude_unstructured](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured) | [s0.6](https://huggingface.co/vita-group/llama-2-7b_magnitude_unstructured/tree/s0.6) |
105
+ | 7 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.1](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.1) |
106
+ | 8 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.2](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.2) |
107
+ | 9 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.3](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.3) |
108
+ | 10 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.5](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.5) |
109
+ | 11 | Llama-2 | 7b | [sparsegpt_unstructured](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured) | [s0.6](https://huggingface.co/vita-group/llama-2-7b_sparsegpt_unstructured/tree/s0.6) |
110
+ | 12 | Llama-2 | 7b | [wanda_gptq](https://huggingface.co/vita-group/llama-2-7b_wanda_2_4_gptq_4bit_128g) | 4bit_128g |
111
+ | 13 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.1](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.1) |
112
+ | 14 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.2](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.2) |
113
+ | 15 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.3](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.3) |
114
+ | 16 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.5](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.5) |
115
+ | 17 | Llama-2 | 7b | [wanda_unstructured](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured) | [s0.6](https://huggingface.co/vita-group/llama-2-7b_wanda_unstructured/tree/s0.6) |
116
+ | 18 | Llama-2-chat | 13b | [magnitude_semistruct](https://huggingface.co/vita-group/llama-2-13b-chat_magnitude_semistruct) | [0.5_2to4](https://huggingface.co/vita-group/llama-2-13b-chat_magnitude_semistruct/tree/0.5_2to4) |
117
+ | 19 | Llama-2-chat | 13b | [sparsegpt_semistruct](https://huggingface.co/vita-group/llama-2-13b-chat_sparsegpt_semistruct) | [0.5_2to4](https://huggingface.co/vita-group/llama-2-13b-chat_sparsegpt_semistruct/tree/0.5_2to4) |
118
+ | 20 | vicuna | 13b | [magnitude_semistruct](https://huggingface.co/vita-group/vicuna-13b-v1.3_magnitude_semistruct) | [0.5_2to4](https://huggingface.co/vita-group/vicuna-13b-v1.3_magnitude_semistruct/tree/0.5_2to4) |
119
+ | 21 | vicuna | 13b | [sparsegpt_semistruct](https://huggingface.co/vita-group/vicuna-13b-v1.3_sparsegpt_semistruct) | [0.5_2to4](https://huggingface.co/vita-group/vicuna-13b-v1.3_sparsegpt_semistruct/tree/0.5_2to4) |
120
+ | 22 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [10bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/10bit_128g) |
121
+ | 23 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [12bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/12bit_128g) |
122
+ | 24 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [14bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/14bit_128g) |
123
+ | 25 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [2bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/2bit_128g) |
124
+ | 26 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [3bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/3bit_128g) |
125
+ | 27 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [4bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/4bit_128g) |
126
+ | 28 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [6bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/6bit_128g) |
127
+ | 29 | vicuna-v1.3 | 13b | [gptq](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq) | [8bit_128g](https://huggingface.co/vita-group/vicuna-13b-v1.3_gptq/tree/8bit_128g) |
128
+ | 30 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [10bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/10bit_128g) |
129
+ | 31 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [12bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/12bit_128g) |
130
+ | 32 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [14bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/14bit_128g) |
131
+ | 33 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [2bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/2bit_128g) |
132
+ | 34 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [3bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/3bit_128g) |
133
+ | 35 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [4bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/4bit_128g) |
134
+ | 36 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [6bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/6bit_128g) |
135
+ | 37 | vicuna-v1.3 | 7b | [gptq](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq) | [8bit_128g](https://huggingface.co/vita-group/vicuna-7b-v1.3_gptq/tree/8bit_128g) |
136
 
137
 
138
  For any question, please contact [Junyuan Hong](mailto:[email protected]).