self-correct_mistral-small-it_mMQA_dpo_iter3_beta.05 / model-00002-of-00009.safetensors

Commit History

Training in progress, step 26
3e569a7
verified

RyanYr commited on