Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots Paper โข 2504.03735 โข Published 17 days ago
Unfair Alignment: Examining Safety Alignment Across Vision Encoder Layers in Vision-Language Models Paper โข 2411.04291 โข Published Nov 6, 2024
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model Paper โข 2503.05132 โข Published Mar 7 โข 55
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper โข 2411.10440 โข Published Nov 15, 2024 โข 124
Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models Paper โข 2307.14539 โข Published Jul 26, 2023 โข 2 โข 1