Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking β’ 6 items β’ Updated 10 days ago β’ 61
Sky-T1-7B Collection A series of 7B models trained with different recipes and the corresponding training data. β’ 8 items β’ Updated Feb 14 β’ 6
Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond β’ 7 items β’ Updated Mar 13 β’ 11
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper β’ 2412.04862 β’ Published Dec 6, 2024 β’ 51
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Dec 6, 2024 β’ 662
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated 22 days ago β’ 223
GAVEL: Generating Games Via Evolution and Language Models Paper β’ 2407.09388 β’ Published Jul 12, 2024 β’ 17