A Recipe for Scaling up Text-to-Video Generation with Text-free Videos Paper • 2312.15770 • Published Dec 25, 2023 • 12
HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Paper • 2312.15980 • Published Dec 26, 2023 • 10
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers Paper • 2312.12468 • Published Dec 19, 2023 • 10
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors Paper • 2312.13324 • Published Dec 20, 2023 • 9
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Paper • 2312.13763 • Published Dec 21, 2023 • 9
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis Paper • 2312.13834 • Published Dec 20, 2023 • 26
VideoPoet: A Large Language Model for Zero-Shot Video Generation Paper • 2312.14125 • Published Dec 21, 2023 • 44
VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams Paper • 2312.01407 • Published Dec 3, 2023 • 6
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper • 2311.07069 • Published Nov 13, 2023 • 43
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Paper • 2311.12454 • Published Nov 21, 2023 • 30
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Paper • 2311.10093 • Published Nov 16, 2023 • 56
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning Paper • 2311.10709 • Published Nov 17, 2023 • 24
MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer Paper • 2311.12052 • Published Nov 18, 2023 • 31
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models Paper • 2312.00845 • Published Dec 1, 2023 • 36
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation Paper • 2310.10769 • Published Oct 16, 2023 • 8