LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent Paper • 2309.12311 • Published Sep 21, 2023 • 17
Gated Slot Attention for Efficient Linear-Time Sequence Modeling Paper • 2409.07146 • Published Sep 11 • 19
Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions Paper • 2406.09264 • Published Jun 13 • 1
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Paper • 2406.05132 • Published Jun 7 • 27
Large Language Models Can Be Easily Distracted by Irrelevant Context Paper • 2302.00093 • Published Jan 31, 2023
Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue Paper • 2305.11271 • Published May 18, 2023
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation Paper • 2402.16846 • Published Feb 26
CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation Paper • 2310.13165 • Published Oct 19, 2023
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents Paper • 2210.12511 • Published Oct 22, 2022
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models Paper • 2306.08685 • Published Jun 14, 2023 • 1
DANLI: Deliberative Agent for Following Natural Language Instructions Paper • 2210.12485 • Published Oct 22, 2022
Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models Paper • 2310.19619 • Published Oct 30, 2023