[2404.16014] Improving Dictionary Learning with Gated Sparse Autoencoders - discu.eu

Linking pages

Mapping the Mind of a Large Language Model \ Anthropic https://www.anthropic.com/news/mapping-mind-language-model 2 comments
Mapping the Mind of a Large Language Model \ Anthropic https://www.anthropic.com/research/mapping-mind-language-model 1 comment
Prism: mapping interpretable concepts and features in a latent space of language | thesephist.com https://thesephist.com/posts/prism/ 1 comment
An Intuitive Explanation of Sparse Autoencoders for Mechanistic Interpretability of LLMs | Adam Karvonen https://adamkarvonen.github.io/machine_learning/2024/06/11/sae-intuitions.html 0 comments

Related searches:

Search whole site: site:arxiv.org

Search title: [2404.16014] Improving Dictionary Learning with Gated Sparse Autoencoders

See how to search.

Submit link to: