Unggi Lee's picture

Unggi Lee

Unggi

·

codingchild2424

AI & ML interests

knowledge tracing

Recent Activity

upvoted a paper about 14 hours ago

Retrieve, Don't Retrain: Extending Vision Language Action Models to New Tasks at Test Time

updated a collection about 1 month ago

updated a model about 1 month ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2

View all activity

Organizations

upvoted a paper about 14 hours ago

Retrieve, Don't Retrain: Extending Vision Language Action Models to New Tasks at Test Time

Paper • 2606.15631 • Published 3 days ago • 12

updated a collection about 1 month ago

Special-R1

9 items • Updated May 5

updated a model about 1 month ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2

Text Generation • 8B • Updated May 4 • 4

published a model about 1 month ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2

Text Generation • 8B • Updated May 4 • 4

updated a collection 2 months ago

Special-R1

9 items • Updated May 5

updated a model 2 months ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward

Text Generation • 8B • Updated Apr 17 • 6

published a model 2 months ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward

Text Generation • 8B • Updated Apr 17 • 6

updated a collection 2 months ago

Special-R1

9 items • Updated May 5

updated a model 2 months ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward

Text Generation • 8B • Updated Apr 7 • 1

published a model 2 months ago

OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward

Text Generation • 8B • Updated Apr 7 • 1

updated a collection 3 months ago

Special-R1

9 items • Updated May 5

updated a collection 4 months ago

Special-R1

9 items • Updated May 5

updated 2 collections 5 months ago

Special-R1

9 items • Updated May 5

PedagogyRL-Experiments

5 items • Updated Mar 2

updated a model 5 months ago

OpenLearnLM/qwen2.5_7b_nothink_noreward_grpo_step_300

8B • Updated Jan 13 • 26

published a model 5 months ago

OpenLearnLM/qwen2.5_7b_nothink_noreward_grpo_step_300

8B • Updated Jan 13 • 26