Linking pages
- Gemma Scope: helping the safety community shed light on the inner workings of language models - Google DeepMind https://deepmind.google/discover/blog/gemma-scope-helping-the-safety-community-shed-light-on-the-inner-workings-of-language-models/ 4 comments
- GitHub - JShollaj/awesome-llm-interpretability: A curated list of Large Language Model (LLM) Interpretability resources. https://github.com/JShollaj/awesome-llm-interpretability 1 comment
Related searches:
Search whole site: site:arxiv.org
Search title: [2310.06824] The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
See how to search.