tFINE-850m-24x24-instruct-L2 / train_results.json
pszemraj's picture
End of training
59876ca verified
raw
history blame
280 Bytes
{
"epoch": 1.0,
"num_input_tokens_seen": 750938410,
"total_flos": 3.6248418467253043e+18,
"train_loss": 1.2696220230851407,
"train_runtime": 79988.0702,
"train_samples": 1013227,
"train_samples_per_second": 12.667,
"train_steps_per_second": 0.099
}