Text-to-Image Base Models Collection All text-to-image open source base models, with their respective license β’ 28 items β’ Updated May 10, 2024 β’ 22
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated Dec 13, 2024 β’ 126
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. β’ 19 items β’ Updated Nov 22, 2024 β’ 69
Vision Language Models Papers πΌοΈπ¬π Collection Papers about vision-language models, most important ones are on top of the list. β’ 27 items β’ Updated Apr 30, 2024 β’ 35
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated 5 days ago β’ 546
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 702
Gemma release Collection Groups the Gemma models released by the Google team. β’ 40 items β’ Updated Dec 13, 2024 β’ 329
Top 10% instruction tuning datasets Collection Collects datasets with 'instruction' in the name and more than 1 download and in the top 10% for the number of likes β’ 13 items β’ Updated Jul 3, 2024 β’ 7