meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated 12 days ago • 723k • • 806
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 392
MCG-NJU/videomae-base-finetuned-kinetics Video Classification • Updated Mar 29, 2024 • 78.4k • 34
openai/whisper-large-v3 Automatic Speech Recognition • Updated Aug 12, 2024 • 4.84M • • 4.29k