Xudong Xu's picture

Xudong Xu

Sheldoooon

·

https://sheldontsui.github.io/

SheldonTsui

AI & ML interests

AIGC for Embodied AI

Recent Activity

upvoted a paper about 1 month ago

EgoSim: Egocentric World Simulator for Embodied Interaction Generation

upvoted a paper about 1 month ago

GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation

upvoted a paper 4 months ago

RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

View all activity

Organizations

upvoted 2 papers about 1 month ago

EgoSim: Egocentric World Simulator for Embodied Interaction Generation

Paper • 2604.01001 • Published Apr 1 • 38

GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation

Paper • 2603.26661 • Published Mar 27 • 25

upvoted a paper 4 months ago

RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

Paper • 2601.05241 • Published Jan 8 • 24

upvoted 8 papers 8 months ago

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Paper • 2509.22281 • Published Sep 26, 2025 • 33

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 161

SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent

Paper • 2509.20414 • Published Sep 24, 2025 • 10

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Paper • 2509.21245 • Published Sep 25, 2025 • 40

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation

Paper • 2509.20358 • Published Sep 24, 2025 • 15

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23, 2025 • 31

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 153

liked a dataset 8 months ago

InternRobotics/MesaTask-10K

Updated Sep 29, 2025 • 396 • 15

upvoted a paper 8 months ago

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Paper • 2509.12815 • Published Sep 16, 2025 • 41

liked 2 models 8 months ago

lhjiang/anysplat

Image-to-3D • Updated Sep 17, 2025 • 11k • 11

InternRobotics/F1-VLA

Robotics • 4B • Updated Sep 9, 2025 • 39 • 32

liked 5 datasets 8 months ago

InternRobotics/MotionMillion

Viewer • Updated Nov 17, 2025 • 1.25M • 207 • 46

InternRobotics/InternData-N1

Updated Feb 6 • 43.5k • 71

InternRobotics/InternData-A1

Preview • Updated Mar 27 • 23k • 88

InternRobotics/InternData-M1

Viewer • Updated Dec 12, 2025 • 1.66M • 3.45k • 30

InternRobotics/InternScenes

Viewer • Updated Feb 4 • 3.2M • 9.68k • 36