UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 14 days ago • 48
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 21 days ago • 272
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 22 days ago • 89
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published Jan 4 • 90