Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 11 items β’ Updated 7 days ago β’ 436
π«StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. β’ 2 items β’ Updated 18 days ago β’ 90
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! β’ 9 items β’ Updated Feb 26 β’ 67
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 32 items β’ Updated 5 days ago β’ 146
BhasaAnuvaad Collection A Speech Translation Dataset for 13 Indian Languages β’ 11 items β’ Updated Jan 16 β’ 16
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated Feb 20 β’ 251
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 β’ 229
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Paper β’ 2410.02073 β’ Published Oct 2, 2024 β’ 41
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 25 days ago β’ 300
π―DART-Math Collection Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving [NeurIPS 2024] @ https://github.com/hkust-nlp/dart-math β’ 20 items β’ Updated Feb 19 β’ 7
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 15 items β’ Updated Dec 6, 2024 β’ 586