OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 2 days ago • 110
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step Paper • 2504.01956 • Published 8 days ago • 36
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper • 2504.01014 • Published 9 days ago • 57
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Paper • 2504.01016 • Published 9 days ago • 28
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper • 2503.24379 • Published 10 days ago • 72
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper • 2503.23461 • Published 11 days ago • 90
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation Paper • 2503.14941 • Published 22 days ago • 6
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Paper • 2503.21694 • Published 14 days ago • 15
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics Paper • 2503.20308 • Published 15 days ago • 22
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model Paper • 2503.22622 • Published 13 days ago • 17
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image Paper • 2503.17358 • Published 20 days ago • 6
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 15 days ago • 47
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models Paper • 2503.12885 • Published 24 days ago • 43
CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts Paper • 2503.11958 • Published 27 days ago • 3
TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing Paper • 2503.11629 • Published 27 days ago • 6
FlowTok: Flowing Seamlessly Across Text and Image Tokens Paper • 2503.10772 • Published 28 days ago • 18
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published 27 days ago • 132
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity Paper • 2503.07677 • Published Mar 10 • 82
Learning a Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with Occupancy Aids Scene Representation Paper • 2303.09152 • Published Mar 16, 2023 • 1