DamiFass
/

pythia-160m-Project-week1

Model card Files Files and versions Community

pythia-160m-Project-week1 / README.md

DamiFass's picture

Create README file

d414f4b verified 26 days ago

|

576 Bytes

	---
	base_model:
	- EleutherAI/pythia-160m
	---

	## Model Description

	This is the pythia-160m from EleutherAI re-uploaded as an exercise.

	## Evaluation Results

	According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark.

	### Hellaswag

	\| Tasks \|Version\|Filter\|n-shot\| Metric \| \|Value \| \|Stderr\|
	\|---------\|------:\|------\|-----:\|--------\|---\|-----:\|---\|-----:\|
	\|hellaswag\| 1\|none \| 0\|acc \|↑ \|0.2872\|± \|0.0045\|
	\| \| \|none \| 0\|acc_norm\|↑ \|0.3082\|± \|0.0046\|