nash_dpo_rank4_iter_real_plus_3 / train_results.json

Commit History

Model save
d8c02b1
verified

YYYYYYibo commited on