RobustDexGrasp: Robust Dexterous Grasping of General Objects from Single-view Perception Paper • 2504.05287 • Published 4 days ago • 3
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments Paper • 2504.03886 • Published 7 days ago • 6
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding Paper • 2504.06719 • Published 3 days ago • 7
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography Paper • 2504.07083 • Published 2 days ago • 19
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 3 days ago • 125
One-Minute Video Generation with Test-Time Training Paper • 2504.05298 • Published 4 days ago • 88
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step Paper • 2504.01956 • Published 9 days ago • 36
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper • 2504.01014 • Published 10 days ago • 57
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Paper • 2504.01016 • Published 10 days ago • 28
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper • 2503.24379 • Published 11 days ago • 72
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper • 2503.23461 • Published 12 days ago • 90
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation Paper • 2503.14941 • Published 24 days ago • 6
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Paper • 2503.21694 • Published 15 days ago • 15
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics Paper • 2503.20308 • Published 17 days ago • 22
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model Paper • 2503.22622 • Published 14 days ago • 17
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image Paper • 2503.17358 • Published 21 days ago • 6
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 17 days ago • 47
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models Paper • 2503.12885 • Published 26 days ago • 43
CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts Paper • 2503.11958 • Published 28 days ago • 3