metadata
license: apache-2.0
datasets:
- Anthropic/hh-rlhf
language:
- en
pipeline_tag: text-generation
The reference model after supervised fine-tuning on the chosen response.
license: apache-2.0
datasets:
- Anthropic/hh-rlhf
language:
- en
pipeline_tag: text-generation
The reference model after supervised fine-tuning on the chosen response.