WorldCache: Accelerating World Models for Free via Heterogeneous Token Caching Paper • 2603.06331 • Published 5 days ago • 3
EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation Paper • 2603.06014 • Published 5 days ago • 7
RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies Paper • 2603.04639 • Published 6 days ago • 20
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published 11 days ago • 29
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning Paper • 2603.04918 • Published 6 days ago • 54
Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling Paper • 2603.04553 • Published 6 days ago • 3
Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline Paper • 2603.05484 • Published 5 days ago • 4
UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data Paper • 2603.05312 • Published 5 days ago • 7
Large Multimodal Models as General In-Context Classifiers Paper • 2602.23229 • Published 12 days ago • 20
DreamWorld: Unified World Modeling in Video Generation Paper • 2603.00466 • Published 11 days ago • 16
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 6 days ago • 15
RealWonder: Real-Time Physical Action-Conditioned Video Generation Paper • 2603.05449 • Published 5 days ago • 10
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images Paper • 2603.02210 • Published 8 days ago • 27
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 12 days ago • 37
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published 7 days ago • 85