SAE-Reasoning Collection Models and datasets used in the paper "Interpreting Reasoning Features in Large Language Models via Sparse Autoenoder": https://arxiv.org/abs/2503.188 • 4 items • Updated 22 days ago
andreuka18/DeepSeek-R1-Distill-Qwen-7B-lmsys-openthoughts-tokenized Viewer • Updated 23 days ago • 781k • 196
andreuka18/DeepSeek-R1-Distill-Qwen-7B-lmsys-openthoughts-tokenized Viewer • Updated 23 days ago • 781k • 196
andreuka18/DeepSeek-R1-Distill-Qwen-7B-lmsys-chat-1m-tokenized Viewer • Updated 23 days ago • 486k • 127
andreuka18/DeepSeek-R1-Distill-Qwen-7B-lmsys-chat-1m-tokenized Viewer • Updated 23 days ago • 486k • 127
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published 29 days ago • 117
SAE-Reasoning Collection Models and datasets used in the paper "Interpreting Reasoning Features in Large Language Models via Sparse Autoenoder": https://arxiv.org/abs/2503.188 • 4 items • Updated 22 days ago