self-correct_mistral-small-it_mMQA_dpo_iter2_ref-iter1 / model-00006-of-00009.safetensors

Commit History

Training in progress, step 25
ddf28b9
verified

RyanYr commited on