Zhaolin Gao

GitBag

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a dataset 37 minutes ago
GitBag/1744529405
updated a dataset about 7 hours ago
GitBag/1744529623
updated a dataset about 7 hours ago
GitBag/1744529624
View all activity

Organizations

Cornell-AGI's profile picture

Articles 1

Article
6

RLHF 101: A Technical Dive into RLHF