Phillip Guo

PhillipGuo

AI & ML interests

Interp, Unlearning, Editing

Recent Activity

Organizations

Truthfulness & Deception Research Team's profile picture quirky-lats-at-mats's profile picture LLM Latent Adversarial Training's profile picture

PhillipGuo's activity