OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation Paper • 2407.02371 • Published Jul 2 • 51
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling Paper • 2411.18664 • Published 29 days ago • 23
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning Paper • 2410.09754 • Published Oct 13 • 7
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Paper • 2404.02905 • Published Apr 3 • 65
Diffusion Model Alignment Using Direct Preference Optimization Paper • 2311.12908 • Published Nov 21, 2023 • 47
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis Paper • 2308.08157 • Published Aug 16, 2023 • 2
BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping Paper • 2306.05544 • Published Jun 8, 2023 • 10