PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper β’ 2504.08791 β’ Published 8 days ago β’ 91
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated 10 days ago β’ 34
Small Models Struggle to Learn from Strong Reasoners Paper β’ 2502.12143 β’ Published Feb 17 β’ 34
Pantheon Series Collection Roleplay models with strong personalities. β’ 4 items β’ Updated 28 days ago β’ 4
Llama Nemotron Collection Open, Production-ready Enterprise Models β’ 4 items β’ Updated 1 day ago β’ 36
GGUF Quants - Roleplay Deployment Models Collection GGUF - Models I think are cool and worth using. A lot of what I make is intended only as a part in a further merge or as a test, these are the others. β’ 14 items β’ Updated Sep 3, 2024 β’ 4