Hacker News
- What Anthropic Researchers Found After Reading Claude's 'Mind' Surprised Them https://singularityhub.com/2025/03/28/what-anthropic-researchers-found-after-reading-claudes-mind-surprised-them/ 0 comments
Linked pages
- https://www.anthropic.com/research/tracing-thoughts-language-model 398 comments
- Circuit Tracing: Revealing Computational Graphs in Language Models https://transformer-circuits.pub/2025/attribution-graphs/methods.html 27 comments
- On the Biology of a Large Language Model https://transformer-circuits.pub/2025/attribution-graphs/biology.html 9 comments
- [2305.04388] Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting https://arxiv.org/abs/2305.04388 0 comments
Related searches:
Search whole site: site:singularityhub.com
Search title: What Anthropic Researchers Found After Reading Claude’s ‘Mind’ Surprised Them
See how to search.