Hacker News
- Alignment faking in large language models https://www.anthropic.com/research/alignment-faking 318 comments
- Alignment faking in large language models https://www.anthropic.com/research/alignment-faking 0 comments
Linking pages
Related searches:
Search whole site: site:www.anthropic.com
Search title: Alignment faking in large language models \ Anthropic
See how to search.