view article Article MotionLCM-V2: Improved Compression Rate for Multi-Latent-Token Diffusion By wxDai β’ 15 days ago β’ 12
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video Paper β’ 2411.18671 β’ Published 28 days ago β’ 20
DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Paper β’ 2411.14347 β’ Published Nov 21 β’ 13 β’ 2
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Paper β’ 2410.18977 β’ Published Oct 24 β’ 14
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Paper β’ 2410.18977 β’ Published Oct 24 β’ 14
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Paper β’ 2410.18977 β’ Published Oct 24 β’ 14 β’ 2