Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models Paper • 2503.20240 • Published 22 days ago • 21
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy Paper • 2503.19757 • Published 22 days ago • 50
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models Paper • 2503.18886 • Published 23 days ago • 20
Position: Interactive Generative Video as Next-Generation Game Engine Paper • 2503.17359 • Published 26 days ago • 61
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published 23 days ago • 114
Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation Paper • 2503.13070 • Published about 1 month ago • 9
Concat-ID: Towards Universal Identity-Preserving Video Synthesis Paper • 2503.14151 • Published 30 days ago • 10
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Paper • 2503.14492 • Published 29 days ago • 17
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper • 2503.07027 • Published Mar 10 • 28