Hacker News
- LLMs know more than what they say https://arjunbansal.substack.com/p/llms-know-more-than-what-they-say 18 comments
Linked pages
- https://openai.com/index/extracting-concepts-from-gpt-4/ 143 comments
- [2212.03827] Discovering Latent Knowledge in Language Models Without Supervision https://arxiv.org/abs/2212.03827 86 comments
- Golden Gate Claude \ Anthropic https://www.anthropic.com/news/golden-gate-claude 66 comments
- Transformer Circuits Thread https://transformer-circuits.pub/ 8 comments
- Mapping the Mind of a Large Language Model \ Anthropic https://www.anthropic.com/news/mapping-mind-language-model 2 comments
- The Shift from Models to Compound AI Systems – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2024/02/18/compound-ai-systems/ 1 comment
- [2311.06668] In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering https://arxiv.org/abs/2311.06668 0 comments
- Simple probes can catch sleeper agents \ Anthropic https://www.anthropic.com/research/probes-catch-sleeper-agents 0 comments
- Aligning LLM-as-a-Judge with Human Preferences https://blog.langchain.dev/aligning-llm-as-a-judge-with-human-preferences/ 0 comments
- [2406.00975] Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost https://arxiv.org/abs/2406.00975 0 comments
Related searches:
Search whole site: site:arjunbansal.substack.com
Search title: LLMs Know More Than What They Say - by Ruby Pai
See how to search.