Paloma Collection Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated Dec 23, 2025 • 16
view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 about 23 hours ago • 26
Higher Precision GGUFs / Imatrix Plus Collection Models compressed in higher precision with parts of the model compression remaining in F16/full precision. Increases overall quality in all tasks. • 53 items • Updated Dec 9, 2025 • 9
Long Context - 16k,32k,64k,128k,200k,256k,512k,1000k Collection Listed oldest to newest. Some with up to 1 million context. • 62 items • Updated 1 day ago • 22
Instruct Models - Better instruction following. Collection Q6s Instruct models. Other Qs in model listing. Instruct models are better at one shot / api type usage in most cases vs "chat". Listed: old to new. • 91 items • Updated 1 day ago • 6
Reasoning Source files for GGUF, EXL2, AWQ, GPTQ Collection Reasoning model safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card. • 115 items • Updated 12 days ago • 4
Experiments in Merging Top Models Collection Full precision and Q8 of merged models by me. Focus in this collection is creative, role play, story and fiction. Imatrix GGUFs addition(s) pending. • 41 items • Updated Dec 9, 2025 • 6
Older Models - High Quality / Hard to Find Collection Q6 quants of LLM Models. Other Quants - including Q8 - in models listings. • 72 items • Updated Dec 9, 2025 • 11
Brainstorm Adapter Models - Augmented/Expanded Reasoning Collection Adapters by DavidAU: Splits apart the reasoning center(s) and multiples them 3x, 4x, 8x, 10x, 20x, 40x+. Creativity+ / Logic+ / Detail+ / Prose+ ... • 207 items • Updated 5 days ago • 28