20 10 5

Hanbin Wang

hanbin

https://wanghanbinpanda.github.io/

wanghanbinpanda

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

upvoted a paper 25 days ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

upvoted a paper about 1 month ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

liked a model 3 months ago

openbmb/MiniCPM-o-4_5

View all activity

Organizations

upvoted a paper 25 days ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 26 days ago • 84

upvoted a paper about 1 month ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 66

liked a model 3 months ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated 6 days ago • 130k • 1.37k

upvoted 3 papers 8 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29, 2025 • 22

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 127

published a dataset 9 months ago

hanbin/dolmino-mix-1124-pes2o-hf-3

Updated Aug 25, 2025 • 2

updated a dataset 9 months ago

hanbin/dolmino-mix-1124-pes2o-hf

Viewer • Updated Aug 25, 2025 • 1.09M • 2

published a dataset 9 months ago

hanbin/dolmino-mix-1124-pes2o-hf

Viewer • Updated Aug 25, 2025 • 1.09M • 2

updated 2 models 10 months ago

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

published 2 models 10 months ago

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

updated 2 models 10 months ago

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B

Text Generation • 8B • Updated Jul 28, 2025 • 3

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B

Text Generation • 8B • Updated Jul 28, 2025 • 6

published 2 models 10 months ago

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B

Text Generation • 8B • Updated Jul 28, 2025 • 6

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B

Text Generation • 8B • Updated Jul 28, 2025 • 3

updated a model 10 months ago

hanbin/Qwen2.5-7B-pattern-mixed-6epoch

Text Generation • 8B • Updated Jul 23, 2025 • 2

published a model 10 months ago

hanbin/Qwen2.5-7B-pattern-mixed-6epoch

Text Generation • 8B • Updated Jul 23, 2025 • 2

updated a model 10 months ago

hanbin/Llama-3.1-8B-pretrain-1

Text Generation • 8B • Updated Jul 14, 2025 • 5

Hanbin Wang

AI & ML interests

Recent Activity

Organizations

hanbin's activity