ReT Collection Models and data for ReT: Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval [CVPR 2025] • 6 items • Updated 6 days ago
ReflectiVA Collection Models and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 2 items • Updated 9 days ago
ReT Collection Models and data for ReT: Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval [CVPR 2025] • 6 items • Updated 6 days ago
ReT Collection Models and data for ReT: Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval [CVPR 2025] • 6 items • Updated 6 days ago
ReT Collection Models and data for ReT: Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval [CVPR 2025] • 6 items • Updated 6 days ago
ReT Collection Models and data for ReT: Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval [CVPR 2025] • 6 items • Updated 6 days ago
ReT Collection Models and data for ReT: Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval [CVPR 2025] • 6 items • Updated 6 days ago
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Paper • 2503.15621 • Published 15 days ago