Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Seungyong Moon
symoon11
Follow
0 followers
·
1 following
https://symoon11.github.io/
symoon11
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a model
1 day ago
symoon11/Qwen2.5-1.5B-Open-R1-Distill
published
a model
1 day ago
symoon11/Qwen2.5-1.5B-Open-R1-GRPO
published
a model
1 day ago
symoon11/Qwen2.5-1.5B-Open-R1-Distill
View all activity
Organizations
None yet
Papers
2
arxiv:
2410.02992
arxiv:
2307.03486
models
3
Sort: Recently updated
symoon11/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
1 day ago
symoon11/Qwen2.5-1.5B-Open-R1-GRPO
Updated
1 day ago
symoon11/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
1 day ago
datasets
None public yet