Mahou Collection flammen.ai's production model for casual conversation and character roleplay β’ 24 items β’ Updated Oct 14, 2024 β’ 4
Personal Favorites Collection Recommended models I use often or like for any reason. I recommend reading their cards for more details. β’ 10 items β’ Updated 20 days ago β’ 62
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference Paper β’ 2110.03742 β’ Published Sep 24, 2021 β’ 4
Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts Paper β’ 2210.03885 β’ Published Oct 8, 2022 β’ 1
RPMax v1 Models Collection RPMax series of models with higher creativity and reduced repetition for "classic" RP chats. β’ 16 items β’ Updated Dec 6, 2024 β’ 17
EVA Gen 0.0 Collection RP/creative writing specialist models, trained on a curated mixture of natural and synthetic data. β’ 6 items β’ Updated 16 days ago β’ 3
Recommended large models Collection This collection contains some of the recent models larger than ~25B parameters that should be high quality and reliable β’ 15 items β’ Updated Nov 27, 2024 β’ 11
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76
Daily Driver's/ Current Favorite's Collection Smart, great at rp. What more do i say? β’ 2 items β’ Updated Nov 4, 2024 β’ 11
view article Article π¨ ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming By sted97 β’ Jun 25, 2024 β’ 5
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper β’ 2412.14161 β’ Published 26 days ago β’ 49
Accelerated Preference Optimization for Large Language Model Alignment Paper β’ 2410.06293 β’ Published Oct 8, 2024 β’ 5
TrustLLM: Trustworthiness in Large Language Models Paper β’ 2401.05561 β’ Published Jan 10, 2024 β’ 66
Canonical models Collection This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace β’ 68 items β’ Updated Feb 13, 2024 β’ 14
ELM Collection Collection of various ELM models from "Erasing Conceptual Knowledge from Language Models" β’ 4 items β’ Updated Oct 21, 2024 β’ 2