metadata
library_name: transformers
license: mit
datasets:
- HuggingFaceH4/ultrafeedback_binarized
language:
- en
This is a model released from the preprint: Bootstrapping Language Models with DPO Implicit Rewards. Please refer to our repository for more details.