AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation Paper • 2603.28068 • Published 5 days ago • 9
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Paper • 2604.01658 • Published 3 days ago • 39
Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning Paper • 2604.02007 • Published 3 days ago • 5
LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model Paper • 2604.02097 • Published 3 days ago • 27
GPA: Learning GUI Process Automation from Demonstrations Paper • 2604.01676 • Published 3 days ago • 9
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory Paper • 2604.01007 • Published 3 days ago • 19
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 9 days ago • 153
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 3 days ago • 81
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 3 days ago • 118
TrajectoryMover: Generative Movement of Object Trajectories in Videos Paper • 2603.29092 • Published 5 days ago • 3
OmniRoam: World Wandering via Long-Horizon Panoramic Video Generation Paper • 2603.30045 • Published 5 days ago • 4
TokenDial: Continuous Attribute Control in Text-to-Video via Spatiotemporal Token Offsets Paper • 2603.27520 • Published 7 days ago • 5
Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 6 days ago • 12
All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models Paper • 2604.00479 • Published 4 days ago • 21
Dynin-Omni: Omnimodal Unified Large Diffusion Language Model Paper • 2604.00007 • Published 27 days ago • 18
UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems Paper • 2604.00590 • Published 4 days ago • 7
Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference Paper • 2603.29002 • Published 6 days ago • 5
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding Paper • 2604.00528 • Published 4 days ago • 7