Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
Jiarui Yao
FlippyDora
Follow
0 followers
·
9 following
AI & ML interests
None yet
Recent Activity
updated
a model
13 minutes ago
ScaleML-RLHF/Qwen2.5-Math-7B-raftpp-em-n8-8-iter10
published
a model
15 minutes ago
ScaleML-RLHF/Qwen2.5-Math-7B-raftpp-em-n8-8-iter10
upvoted
a
paper
36 minutes ago
OTC: Optimal Tool Calls via Reinforcement Learning
View all activity
Organizations
models
58
Sort: Recently updated
FlippyDora/Qwen2.5-Math-1.5B-ppo_numina_math-step_120
Updated
Mar 17
•
1
FlippyDora/Qwen2.5-Math-1.5B-ppo_numina_math-step_100
Updated
Mar 17
•
1
FlippyDora/Qwen2.5-Math-1.5B-ppo_numina_math-step_80
Updated
Mar 17
•
1
FlippyDora/Qwen2.5-Math-1.5B-ppo_numina_math-step_60
Updated
Mar 17
•
2
FlippyDora/Qwen2.5-Math-1.5B-ppo_numina_math-step_40
Updated
Mar 17
•
1
FlippyDora/Qwen2.5-Math-1.5B-ppo_numina_math-step_20
Updated
Mar 17
FlippyDora/Qwen2.5-Math-1.5B-grpo_numina_math-step_120
Updated
Mar 16
•
1
FlippyDora/Qwen2.5-Math-1.5B-grpo_numina_math-step_100
Updated
Mar 16
•
1
FlippyDora/Qwen2.5-Math-1.5B-grpo_numina_math-step_80
Updated
Mar 16
•
2
FlippyDora/Qwen2.5-Math-1.5B-grpo_numina_math-step_60
Updated
Mar 16
•
3
Expand 58 models
datasets
111
Sort: Recently updated
FlippyDora/raft_train_numia_prompt_iter5_0_2000
Viewer
•
Updated
Mar 11
•
6.75k
•
32
FlippyDora/numia_prompt_reward_iter5_0-2000
Viewer
•
Updated
Mar 11
•
2k
•
28
FlippyDora/raft_train_numia_prompt_iter4_0_2000
Viewer
•
Updated
Mar 11
•
6.86k
•
33
FlippyDora/numia_prompt_reward_iter4_0-2000
Viewer
•
Updated
Mar 11
•
2k
•
38
FlippyDora/raft_train_numia_prompt_iter3_0_2000
Viewer
•
Updated
Mar 11
•
6.26k
•
24
FlippyDora/numia_prompt_reward_iter3_0-2000
Viewer
•
Updated
Mar 11
•
2k
•
39
FlippyDora/raft_train_numia_prompt_iter2_0_2000
Viewer
•
Updated
Mar 11
•
6.37k
•
46
FlippyDora/numia_prompt_reward_iter2_0-2000
Viewer
•
Updated
Mar 11
•
2k
•
42
FlippyDora/raft_train_numia_prompt_iter1_0_2000
Viewer
•
Updated
Mar 11
•
6.5k
•
45
FlippyDora/numia_prompt_reward_iter1_0-2000
Viewer
•
Updated
Mar 11
•
2k
•
40
Expand 111 datasets