DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control Paper • 2410.13830 • Published Oct 17, 2024 • 24
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Paper • 2411.19108 • Published Nov 28, 2024 • 17
Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection Paper • 2205.09613 • Published May 19, 2022
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution Paper • 2405.16071 • Published May 25, 2024 • 2
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation Paper • 2411.08380 • Published Nov 13, 2024 • 25
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Paper • 2411.19108 • Published Nov 28, 2024 • 17
Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection Paper • 2402.03634 • Published Feb 6, 2024
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution Paper • 2405.16071 • Published May 25, 2024 • 2
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model Paper • 2404.04167 • Published Apr 5, 2024 • 12