TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper • 2503.23461 • Published 21 days ago • 93
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 25 days ago • 49
Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published 24 days ago • 76
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Paper • 2503.21758 • Published 24 days ago • 20
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis Paper • 2503.21749 • Published 24 days ago • 26
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting Paper • 2503.08677 • Published Mar 11 • 28
UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Paper • 2503.08120 • Published Mar 11 • 31
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published Mar 10 • 35
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization Paper • 2503.08619 • Published Mar 11 • 20