From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space Paper • 2604.14142 • Published 2 days ago • 23
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments Paper • 2604.14144 • Published 2 days ago • 60
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published 9 days ago • 105
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 2 days ago • 110
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 3 days ago • 71
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators Paper • 2604.11805 • Published 4 days ago • 16
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory Paper • 2604.08995 • Published 7 days ago • 44
ViVa: A Video-Generative Value Model for Robot Reinforcement Learning Paper • 2604.08168 • Published 8 days ago • 17
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 9 days ago • 34
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling Paper • 2604.07209 • Published 9 days ago • 35
Action Images: End-to-End Policy Learning via Multiview Video Generation Paper • 2604.06168 • Published 10 days ago • 14
EgoSim: Egocentric World Simulator for Embodied Interaction Generation Paper • 2604.01001 • Published 16 days ago • 37
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 15 days ago • 93
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published 16 days ago • 48
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 19 days ago • 143