- Two-faced AI models learn to hide deception | Just like people, AI systems can be deliberately deceptive - ‘sleeper agents’ seem helpful during testing but behave differently once deployed https://www.nature.com/articles/d41586-024-00189-3 38 comments futurology
Linking pages
Related searches:
Search whole site: site:www.nature.com
Search title: Two-faced AI language models learn to hide deception
See how to search.