One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin PRO
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
authored
a paper
3 days ago
VLog: Video-Language Models by Generative Retrieval of Narration
Vocabulary
liked
a dataset
3 days ago
lmms-lab/AISG_Challenge
commented on
a paper
3 days ago
VLog: Video-Language Models by Generative Retrieval of Narration
Vocabulary