1 51 143

Zhaocheng Liu

zhaocheng

https://scholar.google.com/citations?user=Kk-dRIAAAAAJ

AI & ML interests

None yet

Recent Activity

commented on a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

upvoted a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

upvoted a paper 22 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

View all activity

Organizations

zhaocheng's activity

commented a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 4 days ago • 76 •

upvoted a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 4 days ago • 76

upvoted a paper 22 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 25 days ago • 43

liked a dataset about 1 month ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated 4 days ago • 1.79M • 4.57k • 64

upvoted a paper about 1 month ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7 • 26

liked a model about 2 months ago

BadToBest/EchoMimicV2

Updated Jan 6 • 116

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated Feb 24 • 1.7M • • 1.17k

liked a dataset 2 months ago

deepmind/code_contests

Viewer • Updated Jun 11, 2023 • 4.04k • 14k • 164

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated Feb 24 • 1.08M • 616

liked a dataset 2 months ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 3.12k • 107

liked a model 2 months ago

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • Updated Jan 29 • 2.57M • 314

upvoted 3 papers 3 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 60

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 213

liked a model 3 months ago

Qwen/Qwen2.5-Math-7B

Text Generation • Updated Sep 23, 2024 • 133k • 82

upvoted a paper 3 months ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 42

updated a model 3 months ago

zhaocheng/patient_simulator

Updated Jan 23 • 1

liked a model 3 months ago

zhaocheng/patient_simulator

Updated Jan 23 • 1

published a model 3 months ago

zhaocheng/patient_simulator

Updated Jan 23 • 1

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 384