smolm-autoreg-bpe-seed_555 / train_results.json
kanishka's picture
End of training
17a5883
raw
history blame
195 Bytes
{
"epoch": 10.0,
"train_loss": 2.546341962098498,
"train_runtime": 742.1102,
"train_samples": 52812,
"train_samples_per_second": 711.646,
"train_steps_per_second": 11.13
}