Thireus commited on
Commit
a69c174
1 Parent(s): 01933fb

Added PPL results with new-eval

Browse files
Files changed (1) hide show
  1. README.md +22 -4
README.md CHANGED
@@ -40,12 +40,30 @@ Refs:
40
 
41
  # Benchmarks
42
 
43
- Best results in **bold**.
44
 
45
- | Model | wikitext2 PPL | c4 PPL | ptb PPL | C4-VRAM |
 
 
46
  |---|---|---|---|---|
47
- | 4bit-GPTQ - TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g | 8.517391204833984 | **7.058407783508301** | 20.888103485107422 | **8670.26953125** |
48
- | 8bit-GPTQ - Thireus/Vicuna13B-v1.1-8bit-128g | **8.508771896362305** | 7.105874538421631 | **20.75649070739746** | 14840.26171875 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49
 
50
  <br>
51
  <br>
 
40
 
41
  # Benchmarks
42
 
43
+ Using https://github.com/qwopqwop200/GPTQ-for-LLaMa/. Best results in **bold**.
44
 
45
+ *`--benchmark 2048 --check` results:*
46
+
47
+ | Model | wikitext2 PPL | ptb PPL | c4 PPL | VRAM Utilization |
48
  |---|---|---|---|---|
49
+ | 4bit-GPTQ - TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g | 8.517391204833984 | 20.888103485107422 | **7.058407783508301** | **8670.26953125** |
50
+ | 8bit-GPTQ - Thireus/Vicuna13B-v1.1-8bit-128g | **8.508771896362305** | **20.75649070739746** | 7.105874538421631 | 14840.26171875 |
51
+
52
+ *`--eval` results (pending):*
53
+
54
+ | Model | wikitext2 PPL | ptb PPL | c4 PPL |
55
+ |---|---|---|---|
56
+ | 4bit-GPTQ - TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g | | | |
57
+ | 8bit-GPTQ - Thireus/Vicuna13B-v1.1-8bit-128g | | | |
58
+
59
+ *`--new-eval --eval` results:*
60
+
61
+ | Model | wikitext2 PPL | ptb-new PPL | c4-new PPL |
62
+ |---|---|---|---|
63
+ | 4bit-GPTQ - TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g | 7.119165420532227 | 35.637290954589844 | 9.550592422485352 |
64
+ | 8bit-GPTQ - Thireus/Vicuna13B-v1.1-8bit-128g | **6.988043308258057** | **34.264320373535156** | **9.426002502441406** |
65
+
66
+ PPL = Perplexity - https://huggingface.co/docs/transformers/perplexity
67
 
68
  <br>
69
  <br>