Visual Question Decomposition on Multimodal Large Language Models Paper • 2409.19339 • Published 13 days ago • 7
Cottention: Linear Transformers With Cosine Attention Paper • 2409.18747 • Published 14 days ago • 15
INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding Paper • 2409.06210 • Published Sep 10 • 24