Update README.md
Browse files
README.md
CHANGED
@@ -62,11 +62,11 @@ pipeline_tag: text-generation
|
|
62 |
**Main Results**
|
63 |
| Model | Average | ARC | HellaSwag | MMLU | TruthfulQA |
|
64 |
|-----------------------------------------------|---------|-------|-----------|-------|------------|
|
65 |
-
| llama-65b-instruct (***Ours***,
|
66 |
-
| llama-30b-instruct-2048 (***Ours***)
|
67 |
-
| falcon-40b-instruct
|
68 |
-
| llama-30b-instruct (***Ours***)
|
69 |
-
| llama-65b
|
70 |
|
71 |
*Experimental results based on the Open LLM Leaderboard*
|
72 |
|
|
|
62 |
**Main Results**
|
63 |
| Model | Average | ARC | HellaSwag | MMLU | TruthfulQA |
|
64 |
|-----------------------------------------------|---------|-------|-----------|-------|------------|
|
65 |
+
| llama-65b-instruct (***Ours***, ***Local Reproduction***) | **69.4** | **67.6** | **86.5** | **64.9** | **58.8** |
|
66 |
+
| llama-30b-instruct-2048 (***Ours***) | 64.7 | 58.3 | 82.5 | 61.4 | 56.5 |
|
67 |
+
| falcon-40b-instruct | 63.4 | 61.6 | 84.3 | 55.4 | 52.5 |
|
68 |
+
| llama-30b-instruct (***Ours***) | 63.2 | 56.7 | 84.0 | 59.0 | 53.1 |
|
69 |
+
| llama-65b | 62.1 | 57.6 | 84.3 | 63.4 | 43.0 |
|
70 |
|
71 |
*Experimental results based on the Open LLM Leaderboard*
|
72 |
|