Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yangzhao02
/
mistral-7b-dpo-single_pair
like
0
Safetensors
yangzhao02/ListUltraFeedback
mistral
alignment-handbook
ndcg
trl
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
mistral-7b-dpo-single_pair
Commit History
End of training
29bd56f
verified
yangzhao02
commited on
Aug 24, 2024
Model save
7556ac8
verified
yangzhao02
commited on
Aug 24, 2024
Training in progress, step 467
6ed62f8
verified
yangzhao02
commited on
Aug 24, 2024
Training in progress, step 375
3890dd2
verified
yangzhao02
commited on
Aug 24, 2024
Training in progress, step 250
f287337
verified
yangzhao02
commited on
Aug 24, 2024
Training in progress, step 125
ac97467
verified
yangzhao02
commited on
Aug 24, 2024
initial commit
d9e6198
verified
yangzhao02
commited on
Aug 24, 2024