Mind with Eyes: from Language Reasoning to Multimodal Reasoning Paper • 2503.18071 • Published 29 days ago • 3
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Paper • 2503.12797 • Published Mar 17 • 29