M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Organizations
models 51
JunxiongWang/M1-3B
Text Generation • 3B • Updated • 5 • 2
JunxiongWang/M1-3B-SFT
Text Generation • 3B • Updated • 6 • 1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B • Updated • 3
JunxiongWang/MambaInLlama3B_SFT_MATH
3B • Updated • 5
JunxiongWang/MambaInLlama3B_DPO2
3B • Updated • 5
JunxiongWang/MambaInLlama3B_DPO1
3B • Updated • 2
JunxiongWang/MambaInLlama3B_Distill_MATH
3B • Updated • 3
JunxiongWang/MambaInLlama3B_v3
3B • Updated • 4
JunxiongWang/MambaInLlama1B_Distill_MATH
1B • Updated • 2
JunxiongWang/mamba_0_5_distill
Updated • 5
datasets 20
JunxiongWang/QwenFineMATH
Viewer • Updated • 6.71M • 310
JunxiongWang/R1_GR_SFT
Viewer • Updated • 44k • 33
JunxiongWang/R1_SFT
Updated • 151
JunxiongWang/R1_Sythetic_SFT
Viewer • Updated • 1M • 711
JunxiongWang/MATH_SFT
Viewer • Updated • 19.1M • 271
JunxiongWang/R1_OpenThoughts_SFT
Viewer • Updated • 862k • 277
JunxiongWang/R1_am_SFT
Viewer • Updated • 1.4M • 525
JunxiongWang/qwen1b_it_math
Viewer • Updated • 19.1M • 51
JunxiongWang/test_math
Viewer • Updated • 89.1k • 53
JunxiongWang/FineMathV4
Viewer • Updated • 6.7M • 75