MM-IFEngine: Towards Multimodal Instruction Following Paper • 2504.07957 • Published 10 days ago • 33
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography Paper • 2504.07083 • Published 11 days ago • 22
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published Mar 7 • 118
RelightVid: Temporal-Consistent Diffusion Model for Video Relighting Paper • 2501.16330 • Published Jan 27 • 2
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference Paper • 2502.18411 • Published Feb 25 • 73
RelightVid: Temporal-Consistent Diffusion Model for Video Relighting Paper • 2501.16330 • Published Jan 27 • 2