Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Paper โข 2503.16257 โข Published 16 days ago โข 23
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Paper โข 2503.16419 โข Published 16 days ago โข 65
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper โข 2502.05171 โข Published Feb 7 โข 132
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper โข 2502.02492 โข Published Feb 4 โข 64
Unifying Specialized Visual Encoders for Video Language Models Paper โข 2501.01426 โข Published Jan 2 โข 21
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems Paper โข 2407.01370 โข Published Jul 1, 2024 โข 88
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild Paper โข 2305.11147 โข Published May 18, 2023 โข 3