Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hZzy
/
qwen2.5-0.5b-expo-DPO-ES-TRY2
like
0
Safetensors
hZzy/train_pairwise
qwen2
alignment-handbook
ndcg
trl
expo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
main
qwen2.5-0.5b-expo-DPO-ES-TRY2
/
model.safetensors
Commit History
Model save
7f977a6
verified
hZzy
commited on
about 1 month ago
Training in progress, step 528
f506e6f
verified
hZzy
commited on
about 1 month ago
Training in progress, step 477
3d50bcf
verified
hZzy
commited on
about 1 month ago
Training in progress, step 424
5d9a2b9
verified
hZzy
commited on
about 1 month ago
Training in progress, step 371
47b73d0
verified
hZzy
commited on
about 1 month ago
Training in progress, step 318
687443f
verified
hZzy
commited on
about 1 month ago
Training in progress, step 265
a4a91a5
verified
hZzy
commited on
about 1 month ago
Training in progress, step 212
54e93f8
verified
hZzy
commited on
about 1 month ago
Training in progress, step 159
e8e2d15
verified
hZzy
commited on
about 1 month ago
Training in progress, step 106
c777048
verified
hZzy
commited on
about 1 month ago
Training in progress, step 53
10cddb7
verified
hZzy
commited on
about 1 month ago