Richard Ren PRO

notrichardren

AI & ML interests

robustness, interpretability, probing, truthfulness

Recent Activity

updated a dataset 1 day ago
cais/MASK
new activity 2 days ago
cais/MASK:Update README.md
published a model about 1 month ago
notrichardren/lorra_tqa_7b
View all activity

Organizations

Center for AI Safety's profile picture Truthfulness & Deception Research Team's profile picture Robust Control's profile picture