leaderboard-pr-bot's picture
Adding Evaluation Results
f910368
|
raw
history blame
1.8 kB
metadata
license: cc-by-nc-4.0
datasets:
  - Open-Orca/OpenOrca
language:
  - en

Buy Me A Coffee

Merge of Marcoroni-13B and Luban-13B using ties merge.

Weights

Density

Evaluation Results (Open LLM Leaderboard)

Metric Value
Avg. 65.21
ARC (25-shot) 63.65
HellaSwag (10-shot) 82.92
MMLU (5-shot) 58.70
TruthfulQA (0-shot) 55.55

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 51.16
ARC (25-shot) 63.65
HellaSwag (10-shot) 82.92
MMLU (5-shot) 58.7
TruthfulQA (0-shot) 55.55
Winogrande (5-shot) 77.03
GSM8K (5-shot) 10.01
DROP (3-shot) 10.25