Hanning Zhang's picture

3 4

Hanning Zhang

HanningZhang

·

AI & ML interests

None yet

Recent Activity

updated a model about 10 hours ago

HanningZhang/Qwen2.5-Math-7B-raft-plusplus_cliphigher050_em-iter3

published a model about 10 hours ago

HanningZhang/Qwen2.5-Math-7B-raft-plusplus_cliphigher050_em-iter3

updated a model about 17 hours ago

HanningZhang/Qwen2.5-Math-7B-raft-plusplus_cliphigher050_em-iter2

View all activity

Organizations

HanningZhang's activity

upvoted a paper 6 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 7 days ago • 86

upvoted a paper about 2 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 84

upvoted a collection 6 months ago

RLHFlow MATH Process Reward Model

This is a collection of datasets and models of process reward modeling. • 15 items • Updated Nov 9, 2024 • 10