Qiying Yu
qiying
AI & ML interests
None yet
Recent Activity
authored
a paper
21 days ago
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
upvoted
a
paper
21 days ago
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
updated
a dataset
22 days ago
BytedTsinghua-SIA/DAPO-Math-17k
Organizations
qiying's activity
About the Data Generation Method
2
#4 opened 8 months ago
by
qiying
How do you use the rationales and answers in your training?
2
#1 opened 11 months ago
by
qiying
How do you use the rationales and answers in the arxivqa training?
#1 opened 11 months ago
by
qiying
Training Hyperparameters
#6 opened about 1 year ago
by
qiying
Training Code Sharing
#1 opened over 1 year ago
by
qiying