Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hZzy
/
qwen2.5-0.5b-expo-DPO-ES-TRY
like
0
Safetensors
hZzy/train_pairwise
qwen2
alignment-handbook
ndcg
trl
expo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
86daf58
qwen2.5-0.5b-expo-DPO-ES-TRY
Commit History
Training in progress, step 450
86daf58
verified
hZzy
commited on
about 1 month ago
Training in progress, step 400
ba40357
verified
hZzy
commited on
about 1 month ago
Training in progress, step 350
6f06f54
verified
hZzy
commited on
about 1 month ago
Training in progress, step 300
9370d29
verified
hZzy
commited on
about 1 month ago
Training in progress, step 250
6bbec9b
verified
hZzy
commited on
about 1 month ago
Training in progress, step 200
a4c1e66
verified
hZzy
commited on
about 1 month ago
Training in progress, step 150
224e296
verified
hZzy
commited on
about 1 month ago
Training in progress, step 100
e29b732
verified
hZzy
commited on
about 1 month ago
Training in progress, step 50
0aa1c8b
verified
hZzy
commited on
about 1 month ago
initial commit
0ae04cd
verified
hZzy
commited on
about 1 month ago