self-correct_ministral8Bit_mMQA_dpo_iter1 / model-00003-of-00009.safetensors

Commit History

Training in progress, step 14
fe8115f
verified

RyanYr commited on