hZzy
/

qwen2.5-0.5b-expo-DPO-ES-TRY2

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

qwen2.5-0.5b-expo-DPO-ES-TRY2 / model.safetensors

Commit History

Model save

7f977a6
verified

hZzy commited on about 1 month ago

Training in progress, step 528

f506e6f
verified

hZzy commited on about 1 month ago

Training in progress, step 477

3d50bcf
verified

hZzy commited on about 1 month ago

Training in progress, step 424

5d9a2b9
verified

hZzy commited on about 1 month ago

Training in progress, step 371

47b73d0
verified

hZzy commited on about 1 month ago

Training in progress, step 318

687443f
verified

hZzy commited on about 1 month ago

Training in progress, step 265

a4a91a5
verified

hZzy commited on about 1 month ago

Training in progress, step 212

54e93f8
verified

hZzy commited on about 1 month ago

Training in progress, step 159

e8e2d15
verified

hZzy commited on about 1 month ago

Training in progress, step 106

c777048
verified

hZzy commited on about 1 month ago

Training in progress, step 53

10cddb7
verified

hZzy commited on about 1 month ago