long-context-mllm - a xing0047 Collection

xing0047 's Collections

SAM

long-context-mllm

long-context-mllm

updated Oct 27

Visual Context Window Extension: A New Perspective for Long Video Understanding

Paper • 2409.20018 • Published Sep 30 • 9
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4 • 55
Long Context Transfer from Language to Vision

Paper • 2406.16852 • Published Jun 24 • 32
lmms-lab/LongVA-7B-DPO

Text Generation • Updated Jun 26 • 845 • 7
lmms-lab/LongVA-7B

Text Generation • Updated Jun 26 • 848 • 15
FreedomIntelligence/LongLLaVA-9B

Image-Text-to-Text • Updated Oct 12 • 640 • 3
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges

Paper • 2409.01071 • Published Sep 2 • 27
Why Does the Effective Context Length of LLMs Fall Short?

Paper • 2410.18745 • Published Oct 24 • 17
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22 • 25