CreativityEngine / README.md
leaderboard-pr-bot's picture
Adding Evaluation Results
3eec827
|
raw
history blame
840 Bytes
metadata
license: cc-by-nc-4.0

https://huggingface.co/jondurbin/airoboros-lmoe-13b-2.1/tree/main/adapters/creative + https://huggingface.co/elinas/chronos-13b-v2 weight: 0.38

For Dampf

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 52.07
ARC (25-shot) 59.3
HellaSwag (10-shot) 82.42
MMLU (5-shot) 53.55
TruthfulQA (0-shot) 52.46
Winogrande (5-shot) 74.19
GSM8K (5-shot) 9.55
DROP (3-shot) 32.98