Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources Paper • 2504.00595 • Published 10 days ago • 33 • 7
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources Paper • 2504.00595 • Published 10 days ago • 33 • 7
VILA: On Pre-training for Visual Language Models Paper • 2312.07533 • Published Dec 12, 2023 • 23 • 2