arxiv:2501.05452
Xingyu Fu
Fiaa
AI & ML interests
NLP, multimodal
Recent Activity
liked
a dataset
12 days ago
deepcs233/Visual-CoT
liked
a model
18 days ago
stabilityai/stable-video-diffusion-img2vid-xt
authored
a paper
22 days ago
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding