Cameron-Chen's picture
Update README.md
661981a verified
|
raw
history blame
336 Bytes
metadata
library_name: transformers
license: mit
datasets:
  - HuggingFaceH4/ultrafeedback_binarized
language:
  - en

This is a model released from the preprint: Bootstrapping Language Models with DPO Implicit Rewards. Please refer to our repository for more details.