Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
24
59
230
Yinxu Pan
cppowboy
Follow
thomwolf's profile picture
0xSojalSec's profile picture
Mercury7353's profile picture
14 followers
·
62 following
https://github.com/Cppowboy
pnynx3
Cppowboy
AI & ML interests
RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining
Recent Activity
upvoted
a
paper
about 1 hour ago
A Survey of Reinforcement Learning for Large Reasoning Models
upvoted
a
paper
about 1 hour ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
upvoted
a
paper
2 days ago
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
View all activity
Organizations
cppowboy
's models
2
Sort: Recently updated
cppowboy/XAgentLLaMa-7B-preview
Text Generation
•
Updated
Nov 21, 2023
•
13
cppowboy/XAgentLLaMa-34B-preview
Updated
Nov 20, 2023