Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hZzy
/
qwen2.5-0.5b-expo-DPO-ES-TRY1
like
0
Safetensors
hZzy/train_pairwise
qwen2
alignment-handbook
ndcg
trl
expo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
main
qwen2.5-0.5b-expo-DPO-ES-TRY1
Commit History
End of training
63d5d66
verified
hZzy
commited on
29 days ago
Model save
10246fa
verified
hZzy
commited on
29 days ago
Training in progress, step 324
8e7aec6
verified
hZzy
commited on
29 days ago
Training in progress, step 288
e91b2de
verified
hZzy
commited on
29 days ago
Training in progress, step 252
4f2f4f1
verified
hZzy
commited on
30 days ago
Training in progress, step 216
a3b102f
verified
hZzy
commited on
30 days ago
Training in progress, step 180
d5a588a
verified
hZzy
commited on
30 days ago
initial commit
2fc2b9c
verified
hZzy
commited on
30 days ago