Abdullah

amirali1985

AI & ML interests

Mechanistic interpretability, high dimensional geometry, persona role playing.

Recent Activity

updated a collection 2 days ago
activations_steering
published a dataset 2 days ago
amirali1985/llama3.2-1B-it_power_seeking_layer10
View all activity

Organizations

Thoughtworks's profile picture Apart Research's profile picture Martian's profile picture nlp-and-interpretability's profile picture Backdoors research's profile picture PhillipsLab's profile picture TailsResearch's profile picture Flocker AI's profile picture