bhenrym14
/

airophin-v2-13b-PI-8k-fp16

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bhenrym14 commited on Aug 14, 2023

Commit

d6d4c42

•

1 Parent(s): 058c342

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -43,7 +43,7 @@ Previous experiments have demonstrated that orca-like datasets yield substantial
 | 2048 | 5.22 | 5.38 | 5.87 | 5.23 | **5.07** |
 | 4096 | 4.90 | 5.08 | 5.50 | 4.91 | **4.77** |
 | 8192 | **4.71** | 4.90 | 5.32 | Not Tested | 57.1 |
-| 12000 | 30 | **4.82** | 56.1 | Not Tested | Not Tested |
 - This model is very competitive with the Llama-1 33b extended context variants. In fact, it outperforms bhenrym14/airoboros-33b-gpt4-1.4.1-lxctx-PI-16384-fp16 everywhere <=8192 tokens.
 - Not presented here, but this model outperforms the base llama-2-13b on MMLU-fs with a score of 58.3. If this score ends up being be replicated on the HF LLM leaderboard, **this would place this model at 2nd or 3rd overall for MMLU among 13b models (and the #1 for extended context)**

 | 2048 | 5.22 | 5.38 | 5.87 | 5.23 | **5.07** |
 | 4096 | 4.90 | 5.08 | 5.50 | 4.91 | **4.77** |
 | 8192 | **4.71** | 4.90 | 5.32 | Not Tested | 57.1 |
+| 12000 | 55 | **4.82** | 56.1 | Not Tested | Not Tested |
 - This model is very competitive with the Llama-1 33b extended context variants. In fact, it outperforms bhenrym14/airoboros-33b-gpt4-1.4.1-lxctx-PI-16384-fp16 everywhere <=8192 tokens.
 - Not presented here, but this model outperforms the base llama-2-13b on MMLU-fs with a score of 58.3. If this score ends up being be replicated on the HF LLM leaderboard, **this would place this model at 2nd or 3rd overall for MMLU among 13b models (and the #1 for extended context)**