Jian Hu's picture

Jian Hu

chuyi777

·

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 6 days ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

liked a dataset about 2 months ago

open-r1/OpenR1-Math-220k

liked a dataset about 2 months ago

open-thoughts/OpenThoughts-114k

View all activity

Organizations

chuyi777's activity

commented a paper 3 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99 •

New activity in OpenRLHF/Mistral-7b-PRM-Math-Shepherd 6 months ago

怎么下载模型呢？

#1 opened 6 months ago by