Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zwangSalesforce
/
online_dpo
like
0
TensorBoard
Safetensors
gpt_neox
trl
online-dpo
Generated from Trainer
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
online_dpo
Commit History
Model save
ff12045
verified
zwangSalesforce
commited on
Aug 20, 2024
Model save
e3adc52
verified
zwangSalesforce
commited on
Aug 20, 2024
initial commit
77357ee
verified
zwangSalesforce
commited on
Aug 20, 2024