File size: 576 Bytes
d414f4b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
---
base_model:
- EleutherAI/pythia-160m
---
## Model Description
This is the pythia-160m from EleutherAI re-uploaded as an exercise.
## Evaluation Results
According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark.
### Hellaswag
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|---------|------:|------|-----:|--------|---|-----:|---|-----:|
|hellaswag| 1|none | 0|acc |↑ |0.2872|± |0.0045|
| | |none | 0|acc_norm|↑ |0.3082|± |0.0046| |