Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Paper • 2312.03491 • Published Dec 6, 2023 • 33
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling Paper • 2408.03695 • Published Aug 7 • 12
UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures Paper • 2401.11078 • Published Jan 20 • 7
CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models Paper • 2311.11567 • Published Nov 20, 2023 • 8
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond Paper • 2304.04968 • Published Apr 11, 2023
Exploiting Chain Rule and Bayes' Theorem to Compare Probability Distributions Paper • 2012.14100 • Published Dec 28, 2020
Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs Paper • 2202.06510 • Published Feb 14, 2022
Contrastive Attraction and Contrastive Repulsion for Representation Learning Paper • 2105.03746 • Published May 8, 2021 • 1
DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration Paper • 2303.06885 • Published Mar 13, 2023