Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
honggen
/
hard_dpo
like
0
Text Generation
Anthropic/hh-rlhf
English
License:
apache-2.0
Model card
Files
Files and versions
Community
honggen
commited on
Mar 7, 2024
Commit
bd014a4
·
verified
·
1 Parent(s):
178c201
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+10
-0
README.md
ADDED
Viewed
@@ -0,0 +1,10 @@
1
+
---
2
+
license: apache-2.0
3
+
datasets:
4
+
- Anthropic/hh-rlhf
5
+
language:
6
+
- en
7
+
pipeline_tag: text-generation
8
+
---
9
+
10
+
The reference model after supervised fine-tuning on the chosen response.