On Large Multimodal Models as Open-World Image Classifiers Paper • 2503.21851 • Published 9 days ago • 4
Compositional Caching for Training-free Open-vocabulary Attribute Detection Paper • 2503.19145 • Published 12 days ago
On Large Multimodal Models as Open-World Image Classifiers Paper • 2503.21851 • Published 9 days ago • 4 • 2
Automatic benchmarking of large multimodal models via iterative experiment programming Paper • 2406.12321 • Published Jun 18, 2024
Vocabulary-free Image Classification and Semantic Segmentation Paper • 2404.10864 • Published Apr 16, 2024
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation Paper • 2308.09139 • Published Aug 17, 2023