Yurun Yuan
RyanYr
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 hour ago
RyanYr/numina-qwen2.5math-7Bbase-ppo-mbs32_critic
updated
a model
about 1 hour ago
RyanYr/numina-qwen2.5math-7Bbase-ppo-mbs32_actor
published
a model
about 12 hours ago
RyanYr/numina-qwen2.5math-7Bbase-ppo-mbs32_critic
Organizations
None yet
Collections
2
models
254
RyanYr/numina-qwen2.5math-7Bbase-ppo-mbs32_critic
Updated
RyanYr/numina-qwen2.5math-7Bbase-ppo-mbs32_actor
Updated
RyanYr/countdown-qwen2.5-0.5B-brm
Updated
RyanYr/countdown-qwen2.5-0.5B-ppo_actor
Updated
•
2
RyanYr/countdown-qwen2.5-0.5B-ppo_critic
Updated
RyanYr/countdown-qwen2.5-3B-PPOIV
Updated
•
4
RyanYr/countdown-qwen2.5-3B-PPOIV_RWSHP
Updated
•
4
RyanYr/brm-numina-qwen2.5math-7B-base-lr5e-7-mbs16-beta0.001
Updated
•
2
RyanYr/brmrwshp-numina-qwen2.5math-7B-base-lr5e-7-beta0.001
Updated
•
2
RyanYr/qlearn_rwshp-countdown-qwen2.5-3B-5e-7-b.1-rb
Updated
•
2
datasets
765
RyanYr/tutor-critic_samples
Viewer
•
Updated
•
50
•
20
RyanYr/tutor-critic_llama-3.1-8b-instruct-evals-math-prm
Viewer
•
Updated
•
50
•
23
RyanYr/tutor-critic_llama-3.1-8b-instruct-evals-math-text_feedback
Viewer
•
Updated
•
50
•
29
RyanYr/tutor-critic_llama-3.1-8b-instruct-evals-math-rm
Viewer
•
Updated
•
50
•
29
RyanYr/tutor-critic_llama-3.1-8b-instruct-evals-math
Viewer
•
Updated
•
10k
•
34
RyanYr/Qwen2.5-Math-7B_matheval
Viewer
•
Updated
•
1.52k
•
124
RyanYr/brm-numia-qwen2.5math-7B-base-lr4e-7_matheval
Viewer
•
Updated
•
1.52k
•
67
RyanYr/Qwen2.5-7B-DPO-Zero_matheval
Viewer
•
Updated
•
1.52k
•
46
RyanYr/RLHFlowOnlineDPOPPOZero_matheval
Viewer
•
Updated
•
1.52k
•
54
RyanYr/simpleRLZero_matheval
Viewer
•
Updated
•
1.52k
•
51