57 16 119

chansung park PRO

chansung

AI & ML interests

None yet

Recent Activity

updated a model about 6 hours ago

chansung/Qwen2.5-7B-CCRL-1

published a model 1 day ago

chansung/Qwen2.5-7B-CCRL-1

updated a model 2 days ago

chansung/Qwen2.5-1.5B-CCRL-1

View all activity

Organizations

Posts 20

Post

3399

simple guide on the recipe for GRPO on Open-R1 which is built on top of TRL

I think FastAPI wrapper of vLLM with WeightSyncWorker is pretty cool feature. Also, we have many predefined reward functions out of the box!

View all Posts

Articles 5

Article

Explore papers with auto generated Q&As

pinned

Runtime error

142

Llama2 With Gradio Chat

Zero2Story

Co Write With Llama2

✍

pinned

Runtime error

LLMs As Chatbot

🦙

No application file

Adaptsum

📊

models 92

chansung/Qwen2.5-7B-CCRL-1

Updated about 6 hours ago

chansung/Qwen2.5-1.5B-CCRL-1

Text Generation • Updated 2 days ago • 6

chansung/Qwen2.5-1.5B-CCRL-2

Text Generation • Updated 9 days ago • 1

chansung/Qwen2.5-1.5B-CRL-Code-GRPO-exp1

Updated 11 days ago

chansung/Qwen2.5-1.5B-Coder-CRL-GRPO-exp1

Updated 11 days ago

chansung/Qwen2.5-1.5B-Instruct-CRL-Open-R1-Code-GRPO-exp1

Updated 11 days ago

chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO-exp1

Text Generation • Updated 11 days ago • 2

chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO

Updated 11 days ago • 1

chansung/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated 12 days ago • 1

chansung/Qwen2.5-1.5B-Open-R1-GRPO

Updated 17 days ago

datasets 58

chansung/verifiable-coding-problems-python

Viewer • Updated 13 days ago • 949 • 84

chansung/openthoughts-coding-llama-factory

Viewer • Updated 30 days ago • 19.9k • 65

chansung/cqa_synth_ds

Viewer • Updated Jun 3, 2024 • 111k • 54

chansung/coding_synth_ds

Viewer • Updated Jun 3, 2024 • 116k • 44 • 1

chansung/classification_synth_ds

Viewer • Updated Jun 2, 2024 • 92.3k • 68

chansung/classification_synth_ds2

Viewer • Updated Jun 1, 2024 • 424 • 32

chansung/aaa3

Updated Jun 1, 2024 • 5

chansung/aaa2

Updated Jun 1, 2024 • 5

chansung/synth_summarize_dataset

Viewer • Updated May 31, 2024 • 880k • 110

chansung/new_summarize_synth_ds3

Viewer • Updated May 31, 2024 • 301k • 71

chansung park PRO

AI & ML interests

Recent Activity

Organizations

Posts 20

Articles 5

Distilling from Dialogues: Finding Meaning in LLM Interactions

Papers 2

spaces 43 Sort: Recently updated

Paper Q&A

Llama2 With Gradio Chat

Zero2Story

Co Write With Llama2

LLMs As Chatbot

Adaptsum

models 92 Sort: Recently updated

datasets 58 Sort: Recently updated

spaces 43

models 92

datasets 58