arxiv:2407.14561
Arnab Sen Sharma
sensharma
AI & ML interests
How LLMs represent knowledge and how they extract specific information from a latent? Can we perform update/delete/insert a concept into LLM in an human-interpretable way?
Recent Activity
updated
a model
29 days ago
sensharma/sae_llama-1B_layer-8_wiki-2M
updated
a model
29 days ago
sensharma/sae-disent_llama-1B_layer-8_wiki-2M
updated
a model
about 2 months ago
sensharma/Llama-3.2-3B_Talkative_Probe_Step-3000
Organizations
None yet