On the Exploitability of Reinforcement Learning with Human Feedback for Large Language Models Paper • 2311.09641 • Published Nov 16, 2023