Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Paper • 2503.24377 • Published 21 days ago • 17
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Paper • 2503.22952 • Published 24 days ago • 18
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Paper • 2503.21694 • Published 25 days ago • 16
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Paper • 2504.01724 • Published 19 days ago • 63
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published 19 days ago • 35
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Paper • 2504.01934 • Published 19 days ago • 22
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published 20 days ago • 82
Articulated Kinematics Distillation from Video Diffusion Models Paper • 2504.01204 • Published 20 days ago • 23