Submitted by akhaliq 20 mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration · 10 authors 2
Submitted by akhaliq 18 TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models · 4 authors 5
Submitted by akhaliq 13 Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation · 10 authors
Submitted by akhaliq 9 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features · 4 authors
Submitted by akhaliq 7 GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs · 5 authors