self-correct_ministral8Bit_mMQA_dpo_iter2 / model-00002-of-00004.safetensors

Commit History

Training in progress, step 51
51dc796
verified

RyanYr commited on

Training in progress, step 48
827856e
verified

RyanYr commited on

Training in progress, step 42
b07071b
verified

RyanYr commited on

Training in progress, step 36
e13a2ab
verified

RyanYr commited on

Training in progress, step 30
9fb6ffa
verified

RyanYr commited on

Training in progress, step 24
191e247
verified

RyanYr commited on

Training in progress, step 18
5f682ac
verified

RyanYr commited on

Training in progress, step 12
12289ad
verified

RyanYr commited on

Training in progress, step 6
a93a1f4
verified

RyanYr commited on