23 28 22

Qinghong (Kevin) Lin PRO

KevinQHLin

http://qhlin.me/

AI & ML interests

Vision-Language Model, Video Understanding, Human-AI Interaction

Recent Activity

upvoted a paper 11 days ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

liked a model 19 days ago

yeliudev/VideoMind-2B

liked a dataset 19 days ago

yeliudev/VideoMind-Dataset

View all activity

Organizations

KevinQHLin's activity

upvoted a paper 11 days ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published 14 days ago • 12

liked a model 19 days ago

yeliudev/VideoMind-2B

Video-Text-to-Text • Updated 17 days ago • 202 • 1

liked a dataset 19 days ago

yeliudev/VideoMind-Dataset

Preview • Updated 18 days ago • 4.81k • 2

upvoted a collection 23 days ago

VideoMind

Collection

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning • 8 items • Updated 22 days ago • 3

liked a Space 23 days ago

VideoMind 2B

💡

A Chain-of-LoRA Agent for Long Video Reasoning

updated a dataset 24 days ago

KevinQHLin/Videodata

Viewer • Updated 24 days ago • 581 • 188

published a dataset 24 days ago

KevinQHLin/Videodata

Viewer • Updated 24 days ago • 581 • 188

upvoted 2 papers 27 days ago

Edit Transfer: Learning Image Editing via Vision In-Context Relations

Paper • 2503.13327 • Published Mar 17 • 29

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published 28 days ago • 72

authored a paper about 1 month ago

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Paper • 2503.13444 • Published Mar 17 • 15

upvoted a paper about 1 month ago

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Paper • 2503.13444 • Published Mar 17 • 15

commented a paper about 1 month ago

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Paper • 2503.13444 • Published Mar 17 • 15 •

authored a paper about 1 month ago

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

Paper • 2503.09402 • Published Mar 12 • 6

liked a dataset about 1 month ago

lmms-lab/AISG_Challenge

Viewer • Updated Mar 11 • 1.5k • 1.68k • 6

commented a paper about 1 month ago

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

Paper • 2503.09402 • Published Mar 12 • 6 •

upvoted a paper about 1 month ago

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published Mar 12 • 44

updated a model about 1 month ago

KevinQHLin/VLog

Updated Mar 12

published a model about 1 month ago

KevinQHLin/VLog

Updated Mar 12

upvoted a paper about 1 month ago

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published Mar 10 • 43

updated a model about 1 month ago

showlab/ShowUI-2B

Updated Mar 11 • 12.9k • 249