Xiaobo Wang
Yofuria
AI & ML interests
LLMs, Alignment, CL
Recent Activity
upvoted
an
article
16 days ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
updated
a dataset
about 1 month ago
Yofuria/mistral-instruct-ultrafeedback_multi_pairs
Organizations
Yofuria's activity
No public activity