Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
15 days ago
ProcessBench: Identifying Process Errors in Mathematical Reasoning
updated
a model
25 days ago
OpenRLHF/Llama-3-8b-rm-mixture
updated
a model
25 days ago
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt
Organizations
chuyi777's activity
怎么下载模型呢?
1
#1 opened about 2 months ago
by
Yutong001
OOM on A100
#3 opened 8 months ago
by
chuyi777
Is there any SFT or Chat model?
2
#41 opened 8 months ago
by
chuyi777