MegaMath: Pushing the Limits of Open Math Corpora Paper β’ 2504.02807 β’ Published 12 days ago β’ 29
kaitchup/DeepSeek-R1-Distill-Qwen-14B-AutoRound-GPTQ-4bit Text Generation β’ Updated Jan 27 β’ 264 β’ 6
Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond β’ 7 items β’ Updated Mar 13 β’ 11
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper β’ 2502.14922 β’ Published Feb 19 β’ 31
TransMLA: Multi-head Latent Attention Is All You Need Paper β’ 2502.07864 β’ Published Feb 11 β’ 49
FuseO1-Preview Collection System-II Reasoning Fusion of LLMs β’ 11 items β’ Updated 7 days ago β’ 22