Hanbin Wang

hanbin

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model 6 days ago
PRIME-RL/Eurus-2-7B-PRIME
updated a model 6 days ago
PRIME-RL/Eurus-2-7B-SFT
updated a dataset 7 days ago
PRIME-RL/Eurus-2-RL-Data
View all activity

Articles

Organizations

OpenBMB's profile picture PRIME's profile picture

hanbin's activity

New activity in PRIME-RL/Eurus-2-7B-PRIME 9 days ago

Evaluation

6
#1 opened 10 days ago by
tugstugi
upvoted an article 10 days ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu
15