arxiv:2501.05122
Flo Schneider
floschne
AI & ML interests
Large Vision-Language Models, Cross-modal Retrieval
Recent Activity
upvoted
a
collection
7 days ago
Qwen2.5-VL
updated
a dataset
8 days ago
floschne/gimmick-vvqa
published
a dataset
16 days ago
floschne/gimmick-vvqa
Organizations
models
None public yet
datasets
15
floschne/gimmick-vvqa
Updated
•
139
floschne/wismir3
Viewer
•
Updated
•
301k
•
171
floschne/xflickrco_1k
Viewer
•
Updated
•
8k
•
53
•
1
floschne/xflickrco
Viewer
•
Updated
•
16k
•
66
•
1
floschne/xgqa_1k
Viewer
•
Updated
•
8k
•
63
floschne/xvnli
Viewer
•
Updated
•
5.82k
•
47
floschne/xgqa
Viewer
•
Updated
•
77.3k
•
116
floschne/xm3600_1k
Updated
•
108
floschne/xm3600
Updated
•
52
•
5
floschne/m5b_vlod
Viewer
•
Updated
•
1.42k
•
29