RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment Paper • 2307.12950 • Published Jul 24, 2023 • 10
sam-paech/gutenberg3-generalfiction-scifi-fantasy-romance-adventure-dpo Viewer • Updated Oct 23, 2024 • 5.65k • 153 • 21