ldwang

ldwang

AI & ML interests

LLM, MLLM, Infra

Recent Activity

upvoted a collection about 16 hours ago
SimpleRL
updated a collection 3 days ago
MiscModels
liked a model 3 days ago
deepseek-ai/deepseek-vl2-tiny
View all activity

Organizations

Beijing Academy of Artificial Intelligence's profile picture PetiteTech's profile picture

ldwang's activity

upvoted an article 13 days ago
view article
Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By NormalUhr
10
updated a Space 20 days ago
published a Space 21 days ago