File size: 1,068 Bytes
978cc7b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Wandb Run: https://wandb.ai/eleutherai/pythia-rlhf/runs/gy2g8jj1

Model Evals:
|    Tasks     |Version|Filter|  Metric  |Value |   |Stderr|
|--------------|-------|------|----------|-----:|---|-----:|
|arc_challenge |Yaml   |none  |acc       |0.2253|±  |0.0122|
|              |       |none  |acc_norm  |0.2278|±  |0.0123|
|arc_easy      |Yaml   |none  |acc       |0.2551|±  |0.0089|
|              |       |none  |acc_norm  |0.2567|±  |0.0090|
|lambada_openai|Yaml   |none  |perplexity|   NaN|±  |   NaN|
|              |       |none  |acc       |0.0016|±  |0.0005|
|logiqa        |Yaml   |none  |acc       |0.2028|±  |0.0158|
|              |       |none  |acc_norm  |0.2028|±  |0.0158|
|piqa          |Yaml   |none  |acc       |0.4946|±  |0.0117|
|              |       |none  |acc_norm  |0.4924|±  |0.0117|
|sciq          |Yaml   |none  |acc       |0.0140|±  |0.0037|
|              |       |none  |acc_norm  |0.0140|±  |0.0037|
|winogrande    |Yaml   |none  |acc       |0.5036|±  |0.0141|
|wsc           |Yaml   |none  |acc       |0.6346|±  |0.0474|