Towards Monosemanticity: Decomposing Language Models With Dictionary Learning - discu.eu

Hacker News

Towards Monosemanticity: Decomposing Language Models with Dictionary Learning https://transformer-circuits.pub/2023/monosemantic-features/index.html 5 comments 6/10/2023

Linking pages

Related searches:

Search whole site: site:transformer-circuits.pub

Search title: Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

See how to search.

Submit link to: