On Large Multimodal Models as Open-World Image Classifiers Paper • 2503.21851 • Published 27 days ago • 5
Compositional Caching for Training-free Open-vocabulary Attribute Detection Paper • 2503.19145 • Published 30 days ago
Automatic benchmarking of large multimodal models via iterative experiment programming Paper • 2406.12321 • Published Jun 18, 2024
Vocabulary-free Image Classification and Semantic Segmentation Paper • 2404.10864 • Published Apr 16, 2024
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation Paper • 2308.09139 • Published Aug 17, 2023
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss Paper • 2207.11482 • Published Jul 23, 2022
Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition Paper • 2210.05246 • Published Oct 11, 2022