upstage
/

llama-30b-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

wonhosong commited on Jul 19, 2023

Commit

eef8b90

•

1 Parent(s): ee66058

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -63,10 +63,11 @@ pipeline_tag: text-generation
 | Model                                         | Average | ARC   | HellaSwag | MMLU  | TruthfulQA |
 |-----------------------------------------------|---------|-------|-----------|-------|------------|
-| llama-30b-instruct-2048 (Ours)                | **64.7** | 58.3  | 82.5      | 61.4  | **56.5**    |
-| falcon-40b-instruct                           | 63.4    | **61.6** | **84.3**      | 55.4  | 52.5        |
-| llama-30b-instruct (Ours)                     | 63.2    | 56.7  | 84        | 59    | 53.1        |
-| llama-65b                                     | 62.1    | 57.6  | **84.3**      | **63.4**  | 43          |
 *Experimental results based on the Open LLM Leaderboard*

 | Model                                         | Average | ARC   | HellaSwag | MMLU  | TruthfulQA |
 |-----------------------------------------------|---------|-------|-----------|-------|------------|
+| llama-65b-instruct (***Ours***, *Local Reproduction*)                     | **69.4** | **67.6** | **86.5** | **64.9** | **58.8** |
+| llama-30b-instruct-2048 (***Ours***)                | 64.7 | 58.3 | 82.5 | 61.4 | 56.5 |
+| falcon-40b-instruct                           | 63.4 | 61.6 | 84.3 | 55.4 | 52.5 |
+| llama-30b-instruct (***Ours***)                     | 63.2 | 56.7 | 84.0 | 59.0 | 53.1 |
+| llama-65b                                     | 62.1 | 57.6 | 84.3 | 63.4 | 43.0 |
 *Experimental results based on the Open LLM Leaderboard*