Mind with Eyes: from Language Reasoning to Multimodal Reasoning Paper • 2503.18071 • Published about 1 month ago • 3
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Paper • 2503.12797 • Published Mar 17 • 30