storytime-13b / README.md
chargoddard's picture
Adding Evaluation Results (#2)
4f9f2bc
metadata
license: llama2
language:
  - en
tags:
  - llama

Chat model with a storytelling bent.

Recipe:

Responds well to the Alpaca prompt format.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 50.55
ARC (25-shot) 62.03
HellaSwag (10-shot) 83.96
MMLU (5-shot) 57.48
TruthfulQA (0-shot) 52.5
Winogrande (5-shot) 75.53
GSM8K (5-shot) 8.34
DROP (3-shot) 14.0