Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers Paper • 2412.00142 • Published Nov 28, 2024 • 2
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published Oct 21, 2024 • 44
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples Paper • 2410.14669 • Published Oct 18, 2024 • 36
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published Oct 21, 2024 • 44
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples Paper • 2410.14669 • Published Oct 18, 2024 • 36
Salesforce/xgen-mm-phi3-mini-instruct-r-v1 Image-Text-to-Text • Updated about 2 hours ago • 1.26k • 186