Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated 20 days ago • 27
L3-8B-Helium3 Collection The culmination of my first LLM project. Hybrid storytelling and RP model, with a focus on niche fetish content. (This will be a recurring theme.) • 3 items • Updated Sep 16 • 1
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated Oct 1 • 26
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 7 days ago • 180
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Nov 14 • 536
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 19 days ago • 697