README.md · DamiFass/pythia-160m-Project-week1 at main

metadata

base_model:
  - EleutherAI/pythia-160m

Model Description

This is the pythia-160m from EleutherAI re-uploaded as an exercise.

According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark.

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
hellaswag	1	none	0	acc	↑	0.2872	±	0.0045
		none	0	acc_norm	↑	0.3082	±	0.0046