Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
mamba-gpt-3b-v2 is the Best 3B Model! Surpassing dolly-v2-12b
#137
by
CobraMamba
- opened
The best 3B model on the Open LLM Leaderboard, with performance surpassing dolly-v2-12b
Metric | Value |
---|---|
MMLU (5-shot) | 27.1 |
ARC (25-shot) | 42.2 |
HellaSwag (10-shot) | 71.5 |
TruthfulQA (0-shot) | 36.7 |
Avg. | 44.4 |
We use state-of-the-art Language Model Evaluation Harness to run the benchmark tests above.
clefourrier
changed discussion status to
closed