Mistral-7B-TPO-40k / README.md
sahsaeedi's picture
Update README.md
e3fd1af verified
|
raw
history blame
258 Bytes
This is a model released from the preprint: [Triple Preference Optimization: Achieving Better Alignment using a Single Step Optimization](https://arxiv.org/abs/2405.16681). Please refer to our [repository](https://github.com/sahsaeedi/TPO/) for more details.