unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit Image-to-Text • 112B • Updated Apr 12, 2025 • 978 • 80
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 173k • • 1.56k
stabilityai/stable-video-diffusion-img2vid-xt-1-1 Image-to-Video • Updated Jul 10, 2024 • 7.98k • 975
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4 Text Generation • 8B • Updated Aug 7, 2024 • 465k • 86
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4 Text Generation • 71B • Updated Aug 7, 2024 • 768 • 23
Running 194 Vidore Leaderboard 🥇 194 Compare and rank visual document retrieval models across different benchmarks