Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published 5 days ago • 105
Running 2.47k 2.47k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
ReflectiVA Collection Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 2 items • Updated 4 days ago
ReflectiVA Collection Models and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 3 items • Updated 11 days ago
ReflectiVA Collection Models and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025] • 3 items • Updated 11 days ago
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Paper • 2503.15621 • Published 28 days ago
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering Paper • 2411.16863 • Published Nov 25, 2024
ELSA EU Project Collection Dataset and models created inside the ELSA – European Lighthouse on Secure and Safe AI project on Multimedia use case. • 4 items • Updated Nov 25, 2024
LLaVA-MORE Collection LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 • 2 items • Updated Aug 31, 2024