wonhosong commited on
Commit
eef8b90
1 Parent(s): ee66058

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -4
README.md CHANGED
@@ -63,10 +63,11 @@ pipeline_tag: text-generation
63
 
64
  | Model | Average | ARC | HellaSwag | MMLU | TruthfulQA |
65
  |-----------------------------------------------|---------|-------|-----------|-------|------------|
66
- | llama-30b-instruct-2048 (Ours) | **64.7** | 58.3 | 82.5 | 61.4 | **56.5** |
67
- | falcon-40b-instruct | 63.4 | **61.6** | **84.3** | 55.4 | 52.5 |
68
- | llama-30b-instruct (Ours) | 63.2 | 56.7 | 84 | 59 | 53.1 |
69
- | llama-65b | 62.1 | 57.6 | **84.3** | **63.4** | 43 |
 
70
 
71
  *Experimental results based on the Open LLM Leaderboard*
72
 
 
63
 
64
  | Model | Average | ARC | HellaSwag | MMLU | TruthfulQA |
65
  |-----------------------------------------------|---------|-------|-----------|-------|------------|
66
+ | llama-65b-instruct (***Ours***, *Local Reproduction*) | **69.4** | **67.6** | **86.5** | **64.9** | **58.8** |
67
+ | llama-30b-instruct-2048 (***Ours***) | 64.7 | 58.3 | 82.5 | 61.4 | 56.5 |
68
+ | falcon-40b-instruct | 63.4 | 61.6 | 84.3 | 55.4 | 52.5 |
69
+ | llama-30b-instruct (***Ours***) | 63.2 | 56.7 | 84.0 | 59.0 | 53.1 |
70
+ | llama-65b | 62.1 | 57.6 | 84.3 | 63.4 | 43.0 |
71
 
72
  *Experimental results based on the Open LLM Leaderboard*
73