ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 6 days ago • 90
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 7 days ago • 16
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 6 days ago • 70
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 12 days ago • 119
🔱 Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs • 9 items • Updated 22 days ago • 21
Quantization Spaces on the Hub ⚡ Collection A collection of spaces that allow you to quantize on the Hub • 4 items • Updated Nov 4 • 5
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 27 days ago • 29
Thinking LLMs: General Instruction Following with Thought Generation Paper • 2410.10630 • Published Oct 14 • 17
Qwen 2.5 Coder Collection Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated about 8 hours ago • 20
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 27 days ago • 257