Mono-InternVL - a OpenGVLab Collection

OpenGVLab 's Collections

VideoChat-Flash

InternVL2.5-MPO

V2PE

InternVL Adaptation

All-Seeing Project

PVT v2

Mono-InternVL

updated Jan 10

A Pioneering Monolithic MLLM

OpenGVLab/Mono-InternVL-2B

Image-Text-to-Text • Updated Nov 21, 2024 • 7.32k • 32
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10, 2024 • 4