Datasets with reasoning traces for math and code (Train + Eval)
Maojia Song
OrangeEye

·
AI & ML interests
None yet
Recent Activity
updated
a collection
4 days ago
Long Reasoning
upvoted
an
article
4 days ago
The N Implementation Details of RLHF with PPO
published
a model
5 days ago
OrangeEye/Qwen2.5-1.5B-Knowledge-R1-GRPO
Organizations
Collections
1
spaces
1
models
4
datasets
None public yet