mistralit2_1000_STEPS_1e6_05_beta_DPO / final_checkpoint /model-00003-of-00003.safetensors

Commit History

End of training
b4e83b8
verified

tsavage68 commited on