EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion Paper • 2501.13452 • Published 1 day ago • 4
Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Paper • 2501.13926 • Published 1 day ago • 11
Multi-subject Open-set Personalization in Video Generation Paper • 2501.06187 • Published 14 days ago • 13
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Paper • 2501.04698 • Published 16 days ago • 15
Ingredients: Blending Custom Photos with Video Diffusion Transformers Paper • 2501.01790 • Published 22 days ago • 8
VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models Paper • 2412.19645 • Published 28 days ago • 13
Mind the Time: Temporally-Controlled Multi-Event Video Generation Paper • 2412.05263 • Published Dec 6, 2024 • 10
🎬 Video models Collection text-to-video & image-to-video models released by the Chinese community • 22 items • Updated Dec 24, 2024 • 4
Trending Papers - November ✨ Collection Most upvoted paper on the Daily Papers • 10 items • Updated Dec 24, 2024 • 3
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle Paper • 2407.19548 • Published Jul 28, 2024 • 25
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis Paper • 2409.02048 • Published Sep 3, 2024 • 3
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper • 2412.02259 • Published Dec 3, 2024 • 59
Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published Nov 28, 2024 • 33
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model Paper • 2411.17459 • Published Nov 26, 2024 • 10
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 233
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published Nov 26, 2024 • 35
ConsisID Collection Identity-Preserving Text-to-Video Generation by Frequency Decomposition • 4 items • Updated Dec 3, 2024 • 11
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model Paper • 2409.01199 • Published Sep 2, 2024 • 14
MagicTime Collection MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators • 4 items • Updated Nov 29, 2024 • 13