zephyr-7b-dpo-qlora-min-pi-part-0 / train_results.json
YYYYYYibo's picture
Model save
8f2b315 verified
raw
history blame
194 Bytes
{
"epoch": 1.0,
"train_loss": 0.6650766546909626,
"train_runtime": 4758.2952,
"train_samples": 10000,
"train_samples_per_second": 2.102,
"train_steps_per_second": 0.008
}