leaderboard-pr-bot commited on
Commit
e897790
1 Parent(s): 5af92a0

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -313,3 +313,17 @@ Furthermore, some aspects of string theory suggest that the fundamental constitu
313
  In summary, while there is no direct connection between plasma propulsion systems and string theory, there is an indirect connection through the use of the equations of classical electromagnetism, which are also used in string theory. Additionally, some aspects of string theory suggest that the fundamental constituents of matter may have additional properties beyond those described by classical physics.
314
  ```
315
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
313
  In summary, while there is no direct connection between plasma propulsion systems and string theory, there is an indirect connection through the use of the equations of classical electromagnetism, which are also used in string theory. Additionally, some aspects of string theory suggest that the fundamental constituents of matter may have additional properties beyond those described by classical physics.
314
  ```
315
 
316
+
317
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
318
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_migtissera__Synthia-70B-v1.2b)
319
+
320
+ | Metric | Value |
321
+ |-----------------------|---------------------------|
322
+ | Avg. | 64.63 |
323
+ | ARC (25-shot) | 68.77 |
324
+ | HellaSwag (10-shot) | 87.57 |
325
+ | MMLU (5-shot) | 68.81 |
326
+ | TruthfulQA (0-shot) | 57.69 |
327
+ | Winogrande (5-shot) | 83.9 |
328
+ | GSM8K (5-shot) | 35.25 |
329
+ | DROP (3-shot) | 50.41 |