No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published 16 days ago • 41
Visual Context Window Extension: A New Perspective for Long Video Understanding Paper • 2409.20018 • Published Sep 30, 2024 • 9
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding Paper • 2409.18111 • Published Sep 26, 2024 • 6
Visual Context Window Extension: A New Perspective for Long Video Understanding Paper • 2409.20018 • Published Sep 30, 2024 • 9 • 2
Improving Generalization of Image Captioning with Unsupervised Prompt Learning Paper • 2308.02862 • Published Aug 5, 2023
Visual Context Window Extension: A New Perspective for Long Video Understanding Paper • 2409.20018 • Published Sep 30, 2024 • 9
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Paper • 2406.04325 • Published Jun 6, 2024 • 72
DeepSeek-VL: Towards Real-World Vision-Language Understanding Paper • 2403.05525 • Published Mar 8, 2024 • 40