Mimir: Improving Video Diffusion Models for Precise Text Understanding Paper • 2412.03085 • Published 22 days ago • 12
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control Paper • 2410.13830 • Published Oct 17 • 23
FlashFace: Human Image Personalization with High-fidelity Identity Preservation Paper • 2403.17008 • Published Mar 25 • 19
ViM: Vision Middleware for Unified Downstream Transferring Paper • 2303.06911 • Published Mar 13, 2023
LivePhoto: Real Image Animation with Text-guided Motion Control Paper • 2312.02928 • Published Dec 5, 2023 • 16
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Paper • 2311.17002 • Published Nov 28, 2023 • 5
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation Paper • 2311.15773 • Published Nov 27, 2023 • 4
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation Paper • 2311.15841 • Published Nov 27, 2023 • 2
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning Paper • 2303.15230 • Published Mar 27, 2023
Rethinking Supervised Pre-training for Better Downstream Transferring Paper • 2110.06014 • Published Oct 12, 2021
UKnow: A Unified Knowledge Protocol for Common-Sense Reasoning and Vision-Language Pre-training Paper • 2302.06891 • Published Feb 14, 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Paper • 2311.17002 • Published Nov 28, 2023 • 5 • 4
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Paper • 2311.17002 • Published Nov 28, 2023 • 5
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Paper • 2311.17002 • Published Nov 28, 2023 • 5 • 4
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Paper • 2311.17002 • Published Nov 28, 2023 • 5 • 4