- [P] I reproduced Anthropic's recent interpretability research https://jakeward.substack.com/p/monosemanticity-at-home-my-attempt 31 comments machinelearning
Linked pages
- GitHub - karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs. https://github.com/karpathy/nanoGPT 366 comments
- God Help Us, Let's Try To Understand The Paper On AI Monosemanticity https://www.astralcodexten.com/p/god-help-us-lets-try-to-understand 205 comments
- Let's build GPT: from scratch, in code, spelled out. - YouTube https://www.youtube.com/watch?v=kCc8FmEb1nY 105 comments
- Towards Monosemanticity: Decomposing Language Models With Dictionary Learning https://transformer-circuits.pub/2023/monosemantic-features/index.html 5 comments
- Toy Models of Superposition https://transformer-circuits.pub/2022/toy_model/index.html 4 comments
- Neural Networks From Scratch - victorzhou.com https://victorzhou.com/series/neural-networks-from-scratch/ 1 comment
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:jakeward.substack.com
Search title: Monosemanticity at Home: My Attempt at Replicating Anthropic's Interpretability Research from Scratch
See how to search.