Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yangzhao02
/
mistral-7b-dpo-single_pair
like
0
Safetensors
yangzhao02/ListUltraFeedback
mistral
alignment-handbook
ndcg
trl
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
yangzhao02
commited on
Aug 24, 2024
Commit
6ed62f8
•
1 Parent(s):
3890dd2
Training in progress, step 467
Browse files
Files changed (0)
hide
show