Maojia Song's picture
6 80

Maojia Song

OrangeEye

AI & ML interests

None yet

Recent Activity

updated a collection 5 days ago
Long Reasoning
published a model 5 days ago
OrangeEye/Qwen2.5-1.5B-Knowledge-R1-GRPO
View all activity

Organizations

Deep Cognition and Language Research (DeCLaRe) Lab's profile picture

OrangeEye's activity

upvoted an article 5 days ago
view article
Article

The N Implementation Details of RLHF with PPO

• 45