Yinxu Pan's picture

Yinxu Pan

cppowboy

·

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper about 1 hour ago

A Survey of Reinforcement Learning for Large Reasoning Models

upvoted a paper about 1 hour ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

upvoted a paper 2 days ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

View all activity

Organizations

cppowboy 's models 2

cppowboy/XAgentLLaMa-7B-preview

Text Generation • Updated Nov 21, 2023 • 13

cppowboy/XAgentLLaMa-34B-preview

Updated Nov 20, 2023