SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 15 items β’ Updated 3 days ago β’ 195
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 9 items β’ Updated 29 days ago β’ 99
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated Oct 15 β’ 148
Emu3 Collection Emu3: Next-Token Prediction is All You Need β’ 5 items β’ Updated 4 days ago β’ 66
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 15 items β’ Updated 19 days ago β’ 548
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 28 days ago β’ 289
xLAM models Collection xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM β’ 11 items β’ Updated 6 days ago β’ 44
GLiNER bi-encoders Collection Bi-encoder and poly-encoder architectures of GLiNER β’ 5 items β’ Updated Sep 10 β’ 13
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated 3 days ago β’ 204
CommonCatalog Collection Common Catalog, a dataset with Creative Commons licensed images and machine-generated caption pairs β’ 8 items β’ Updated May 16 β’ 14
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22 β’ 69
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated May 6 β’ 89
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11 β’ 75
Vector-io compatible Datasets Collection These datasets can be loaded into your vector database with a single line bash command β’ 15 items β’ Updated Sep 19 β’ 3
Pokemons dataset captioned with different models Collection The Pokemons dataset from Lambda Labs is quite popular in the diffusion community because it lets us quickly validate ideas. β’ 3 items β’ Updated Nov 28, 2023 β’ 3
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. β’ 11 items β’ Updated Apr 3 β’ 115