Chi Tran

ambivalent02

AI & ML interests

LLM, RAG, VLLM

Recent Activity

Organizations

None yet

ambivalent02's activity

upvoted an article 13 days ago
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
76
New activity in HuanjinYao/Mulberry_qwen2vl_7b about 1 month ago

2b checkpoint

#1 opened about 1 month ago by
ambivalent02