smolm-autoreg-bpe-seed_555 / train_results.json
kanishka's picture
End of training
67d5a5d
raw
history blame
197 Bytes
{
"epoch": 10.0,
"train_loss": 2.5528629182903297,
"train_runtime": 747.7642,
"train_samples": 52812,
"train_samples_per_second": 706.265,
"train_steps_per_second": 11.046
}