Linking pages
Linked pages
- Transformer Circuits Thread https://transformer-circuits.pub/ 8 comments
- [1703.04730] Understanding Black-box Predictions via Influence Functions https://arxiv.org/abs/1703.04730 4 comments
- [2308.03296] Studying Large Language Model Generalization with Influence Functions https://arxiv.org/abs/2308.03296 2 comments
- Anthropic \ Measuring Faithfulness in Chain-of-Thought Reasoning https://www.anthropic.com/index/measuring-faithfulness-in-chain-of-thought-reasoning 0 comments
Related searches:
Search whole site: site:www.anthropic.com
Search title: Anthropic \ Tracing Model Outputs to the Training Data
See how to search.