Seungyong Moon's picture

1

Seungyong Moon

symoon11

·

https://symoon11.github.io/

symoon11

AI & ML interests

Reinforcement Learning

Recent Activity

updated a model 1 day ago

symoon11/Qwen2.5-1.5B-Open-R1-Distill

published a model 1 day ago

symoon11/Qwen2.5-1.5B-Open-R1-GRPO

published a model 1 day ago

symoon11/Qwen2.5-1.5B-Open-R1-Distill

View all activity

Organizations

None yet

Papers 2

arxiv:2410.02992

arxiv:2307.03486

models 3

symoon11/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated 1 day ago

symoon11/Qwen2.5-1.5B-Open-R1-GRPO

Updated 1 day ago

symoon11/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated 1 day ago

datasets

None public yet