Enterredaas-33b / README.md
leaderboard-pr-bot's picture
Adding Evaluation Results
39a9416
|
raw
history blame
737 Bytes

Merge of Enterredaas-33b QLoRA

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 50.52
ARC (25-shot) 60.92
HellaSwag (10-shot) 84.18
MMLU (5-shot) 58.3
TruthfulQA (0-shot) 49.02
Winogrande (5-shot) 78.77
GSM8K (5-shot) 16.22
DROP (3-shot) 6.23