Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
honggen
/
hard_dpo
like
0
Text Generation
Anthropic/hh-rlhf
English
License:
apache-2.0
Model card
Files
Files and versions
Community
main
hard_dpo
Commit History
Create README.md
bd014a4
verified
honggen
commited on
Mar 7, 2024
Upload policy.pt
178c201
verified
honggen
commited on
Mar 7, 2024
initial commit
364a4b4
verified
honggen
commited on
Mar 3, 2024