arxiv:2601.21590
xiaotong
xtongji
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving
authored
a paper
3 days ago
Rethinking Large Language Model Distillation: A Constrained Markov
Decision Process Perspective
authored
a paper
3 days ago
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
Organizations
None yet