Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Paper • 2603.12247 • Published 8 days ago • 23
MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization Paper • 2510.08540 • Published Oct 9, 2025 • 110
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding Paper • 2411.03628 • Published Nov 6, 2024 • 2