Text Generation
Safetensors
English
llama
shining-valiant
shining-valiant-2
valiant
valiant-labs
llama-3.2
llama-3.2-instruct
llama-3.2-instruct-3b
llama-3
llama-3-instruct
llama-3-instruct-3b
3b
science
physics
biology
chemistry
compsci
computer-science
engineering
technical
conversational
chat
instruct
Eval Results
eval format
Browse files
README.md
CHANGED
@@ -202,16 +202,3 @@ We care about open source.
|
|
202 |
For everyone to use.
|
203 |
|
204 |
We encourage others to finetune further from our models.
|
205 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
206 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ValiantLabs__Llama3.2-3B-ShiningValiant2)
|
207 |
-
|
208 |
-
| Metric |Value|
|
209 |
-
|-------------------|----:|
|
210 |
-
|Avg. |17.42|
|
211 |
-
|IFEval (0-Shot) |49.12|
|
212 |
-
|BBH (3-Shot) |19.03|
|
213 |
-
|MATH Lvl 5 (4-Shot)| 9.52|
|
214 |
-
|GPQA (0-shot) | 3.02|
|
215 |
-
|MuSR (0-shot) | 4.72|
|
216 |
-
|MMLU-PRO (5-shot) |19.09|
|
217 |
-
|
|
|
202 |
For everyone to use.
|
203 |
|
204 |
We encourage others to finetune further from our models.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|