--- base_model: - EleutherAI/pythia-160m --- ## Model Description This is the pythia-160m from EleutherAI re-uploaded as an exercise. ## Evaluation Results According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark. ### Hellaswag | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr| |---------|------:|------|-----:|--------|---|-----:|---|-----:| |hellaswag| 1|none | 0|acc |↑ |0.2872|± |0.0045| | | |none | 0|acc_norm|↑ |0.3082|± |0.0046|