PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception Paper • 2606.28322 • Published 7 days ago • 26
Perceive-to-Reason: Decoupling Perception and Reasoning for Fine-Grained Visual Reasoning Paper • 2607.01191 • Published 1 day ago • 10
Pause or Fabricate? Training Language Models for Grounded Reasoning Paper • 2604.19656 • Published Apr 21 • 10
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments Paper • 2604.14144 • Published Apr 15 • 63
UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding Paper • 2604.14113 • Published Apr 15 • 10
UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization Paper • 2604.13822 • Published Apr 15 • 7
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments Paper • 2604.14144 • Published Apr 15 • 63
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published Apr 13 • 143
ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models Paper • 2505.21500 • Published May 27, 2025 • 13