UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics โข 335 items โข Updated about 9 hours ago โข 49
Dataset Creation Collection Spaces and utilities for creating datasets and getting them on the Hub โข 3 items โข Updated Nov 10, 2024 โข 10
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated Dec 6, 2024 โข 582
Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated 14 days ago โข 299
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper โข 2409.08264 โข Published Sep 12, 2024 โข 46
LLaVA-Video Collection Models focus on video understanding (previously known as LLaVA-NeXT-Video). โข 8 items โข Updated Feb 21 โข 61
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma โข 16 items โข Updated about 5 hours ago โข 145
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐ Collection Papers about vision-language models, most important ones are on top of the list. โข 27 items โข Updated Apr 30, 2024 โข 36