CalderaAI
/

13B-Thorns-l2

Text Generation

text-generation-inference

Model card Files Files and versions Community

Adding Evaluation Results

#1

by leaderboard-pr-bot - opened Nov 17, 2023

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -106,4 +106,17 @@ Also thanks to Meta for LLaMAv2 and deciding to allow the research community at
 Each model and LoRA was hand picked and considered for what it could contribute to this ensemble.
 Thanks to each and every one of you for your incredible work developing some of the best things
-to come out of this community.

 Each model and LoRA was hand picked and considered for what it could contribute to this ensemble.
 Thanks to each and every one of you for your incredible work developing some of the best things
+to come out of this community.
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_CalderaAI__13B-Thorns-l2)
+| Metric                | Value                     |
+|-----------------------|---------------------------|
+| Avg.                  | 53.5   |
+| ARC (25-shot)         | 62.88          |
+| HellaSwag (10-shot)   | 83.57    |
+| MMLU (5-shot)         | 56.95         |
+| TruthfulQA (0-shot)   | 49.52   |
+| Winogrande (5-shot)   | 74.51   |
+| GSM8K (5-shot)        | 0.91        |
+| DROP (3-shot)         | 46.13         |