microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 13 days ago β’ 618k β’ 1.31k
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated Mar 13 β’ 96
LLaVa-NeXT-Video Collection LLaVa-NeXT-Video extends LLaVa-NeXT for video understanding. β’ 5 items β’ Updated Jun 10, 2024 β’ 9
Running 542 542 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects