Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
honggen
/
hard_dpo
like
0
Text Generation
Anthropic/hh-rlhf
English
License:
apache-2.0
Model card
Files
Files and versions
Community
178c201
hard_dpo
Commit History
Upload policy.pt
178c201
verified
honggen
commited on
Mar 7, 2024
initial commit
364a4b4
verified
honggen
commited on
Mar 3, 2024