DamiFass's picture
Create README file
d414f4b verified
|
raw
history blame
576 Bytes
---
base_model:
- EleutherAI/pythia-160m
---
## Model Description
This is the pythia-160m from EleutherAI re-uploaded as an exercise.
## Evaluation Results
According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark.
### Hellaswag
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|---------|------:|------|-----:|--------|---|-----:|---|-----:|
|hellaswag| 1|none | 0|acc |↑ |0.2872|± |0.0045|
| | |none | 0|acc_norm|↑ |0.3082|± |0.0046|