-
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
Paper • 2503.12797 • Published • 29 -
MaxyLee/DeepPerception
Image-Text-to-Text • Updated • 26 • 2 -
MaxyLee/KVG-Bench
Viewer • Updated • 1.34k • 54 -
MaxyLee/DeepPerception-FGVR
Image-Text-to-Text • Updated • 26
Xinyu Ma
MaxyLee
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
8 days ago
Qwen2.5-VL
new activity
16 days ago
MaxyLee/KVG:Update dataset card: add link to paper, change task category
Organizations
None yet