discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Reddit
[P] I reproduced Anthropic's recent interpretability research
https://jakeward.substack.com/p/monosemanticity-at-home-my-attempt
31 comments
1/5/2024
machinelearning