self-correct_mistral-small-it_mMQA_dpo_iter2_beta.05 / model-00008-of-00009.safetensors

Commit History

Training in progress, step 25
c605c40
verified

RyanYr commited on