Hoagy Cunningham's picture

Hoagy Cunningham

HoagyC

HoagyC

AI & ML interests

None yet

Recent Activity

authored a paper 10 days ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

authored a paper over 1 year ago

Sparse Autoencoders Find Highly Interpretable Features in Language Models

View all activity

Organizations

None yet

HoagyC's activity

authored a paper 10 days ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published 14 days ago • 9

authored a paper over 1 year ago

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 13