TULIP: Towards Unified Language-Image Pretraining Paper • 2503.15485 • Published 20 days ago • 44
We Can't Understand AI Using our Existing Vocabulary Paper • 2502.07586 • Published Feb 11 • 10
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks Paper • 2502.08235 • Published Feb 12 • 56
VisionArena: 230K Real World User-VLM Conversations with Preference Labels Paper • 2412.08687 • Published Dec 11, 2024 • 13
Stylus: Automatic Adapter Selection for Diffusion Models Paper • 2404.18928 • Published Apr 29, 2024 • 15
RAFT: Adapting Language Model to Domain Specific RAG Paper • 2403.10131 • Published Mar 15, 2024 • 73
VideoAgent: Long-form Video Understanding with Large Language Model as Agent Paper • 2403.10517 • Published Mar 15, 2024 • 36
Describing Differences in Image Sets with Natural Language Paper • 2312.02974 • Published Dec 5, 2023 • 16