qwen2-0.5b-hh-sft / train_results.json
yangzhao02's picture
Model save
e4ad68c verified
raw
history blame contribute delete
223 Bytes
{
"epoch": 1.0,
"total_flos": 27695775744.0,
"train_loss": 1.5830808877944946,
"train_runtime": 12.0584,
"train_samples": 500,
"train_samples_per_second": 6.551,
"train_steps_per_second": 0.083
}