Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 16 days ago • 443
Neighboring Autoregressive Modeling for Efficient Visual Generation Paper • 2503.10696 • Published Mar 12 • 8
NAR Collection Neighboring Autoregressive Modeling for Efficient Visual Generation • 10 items • Updated about 1 month ago • 2
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Paper • 2410.08584 • Published Oct 11, 2024 • 12