OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation Paper โข 2407.02371 โข Published Jul 2 โข 51
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling Paper โข 2411.18664 โข Published 29 days ago โข 23
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning Paper โข 2410.09754 โข Published Oct 13 โข 7
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Paper โข 2404.02905 โข Published Apr 3 โข 65
Diffusion Model Alignment Using Direct Preference Optimization Paper โข 2311.12908 โข Published Nov 21, 2023 โข 47
iColoriT: Towards Propagating Local Hint to the Right Region in Interactive Colorization by Leveraging Vision Transformer Paper โข 2207.06831 โข Published Jul 14, 2022
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis Paper โข 2308.08157 โข Published Aug 16, 2023 โข 2
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis Paper โข 2308.08157 โข Published Aug 16, 2023 โข 2