ul_llm_week1 / README.md
tomhata's picture
Update README.md
1e35de5 verified
|
raw
history blame
368 Bytes
---
base_model:
- EleutherAI/pythia-160m
---
# Pythia-160M
Evaluated with Eleuther Evaluation Harness
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|---------|------:|------|-----:|--------|---|-----:|---|-----:|
|hellaswag| 1|none | 0|acc |↑ |0.2872|± |0.0045|
| | |none | 0|acc_norm|↑ |0.3082|± |0.0046|