File size: 576 Bytes
d414f4b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
base_model:
- EleutherAI/pythia-160m
---

## Model Description

This is the pythia-160m from EleutherAI re-uploaded as an exercise.

## Evaluation Results

According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark.

### Hellaswag

|  Tasks  |Version|Filter|n-shot| Metric |   |Value |   |Stderr|
|---------|------:|------|-----:|--------|---|-----:|---|-----:|
|hellaswag|      1|none  |     0|acc     |↑  |0.2872|±  |0.0045|
|         |       |none  |     0|acc_norm|↑  |0.3082|±  |0.0046|