arxiv:2412.05271
Zhaoyang Liu
zyliu
AI & ML interests
Video understanding, 3D Perception, Autonomous driving, Foundation models, AIGC
Recent Activity
authored
a paper
8 days ago
InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots
Beyond Language
authored
a paper
8 days ago
Learning Human Motion Representations: A Unified Perspective
authored
a paper
8 days ago
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model
for Hundreds of Vision-Language Tasks
Organizations
spaces
2
models
11
datasets
None public yet