Update README.md

e3fd1af verified 6 days ago

258 Bytes

This is a model released from the preprint: [Triple Preference Optimization: Achieving Better Alignment using a Single Step Optimization](https://arxiv.org/abs/2405.16681). Please refer to our [repository](https://github.com/sahsaeedi/TPO/) for more details.