System-2 Mathematical Reasoning via Enriched Instruction Tuning Paper • 2412.16964 • Published Dec 22, 2024 • 2
WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents Paper • 2504.15785 • Published Apr 22, 2025 • 22
SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization Paper • 2512.02631 • Published Dec 2, 2025 • 9
GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training Paper • 2512.13043 • Published Dec 15, 2025 • 6
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published 3 days ago • 38
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published Mar 11, 2025 • 17
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published Feb 2, 2025 • 24
Continual Task Allocation in Meta-Policy Network via Sparse Prompting Paper • 2305.18444 • Published May 29, 2023 • 1
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld Paper • 2311.16714 • Published Nov 28, 2023 • 1
PyPop7: A Pure-Python Library for Population-Based Black-Box Optimization Paper • 2212.05652 • Published Dec 12, 2022 • 2
WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents Paper • 2410.07484 • Published Oct 9, 2024 • 51