Enhancing Diffusion Models with Text-Encoder Reinforcement Learning Paper • 2311.15657 • Published Nov 27, 2023 • 2
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models Paper • 2402.08714 • Published Feb 13 • 10
Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning Paper • 2402.06102 • Published Feb 8 • 4
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models Paper • 2402.01118 • Published Feb 2 • 29
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Paper • 2402.10210 • Published Feb 15 • 29