Yixiao Ge
yxgeee
AI & ML interests
Computer Vision, Foundation Models
Recent Activity
authored
a paper
15 days ago
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation
authored
a paper
16 days ago
Moto: Latent Motion Token as the Bridging Language for Robot
Manipulation