File size: 1,069 Bytes
9c862b6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Wandb Run: https://wandb.ai/eleutherai/pythia-rlhf/runs/31gbxj2w

Eval Results:
|    Tasks     |Version|Filter|  Metric  |Value |   |Stderr|
|--------------|-------|------|----------|-----:|---|-----:|
|arc_challenge |Yaml   |none  |acc       |0.2159|±  |0.0120|
|              |       |none  |acc_norm  |0.2295|±  |0.0123|
|arc_easy      |Yaml   |none  |acc       |0.3266|±  |0.0096|
|              |       |none  |acc_norm  |0.3287|±  |0.0096|
|lambada_openai|Yaml   |none  |perplexity|   NaN|±  |   NaN|
|              |       |none  |acc       |0.1750|±  |0.0053|
|logiqa        |Yaml   |none  |acc       |0.2028|±  |0.0158|
|              |       |none  |acc_norm  |0.2028|±  |0.0158|
|piqa          |Yaml   |none  |acc       |0.5441|±  |0.0116|
|              |       |none  |acc_norm  |0.5446|±  |0.0116|
|sciq          |Yaml   |none  |acc       |0.2050|±  |0.0128|
|              |       |none  |acc_norm  |0.1940|±  |0.0125|
|winogrande    |Yaml   |none  |acc       |0.5043|±  |0.0141|
|wsc           |Yaml   |none  |acc       |0.6154|±  |0.0479|