- "AI Deception: A Survey of Examples, Risks, and Potential Solutions", Park et al 2023 https://arxiv.org/abs/2308.14752 6 comments reinforcementlearning
Linking pages
- AI systems have learned how to deceive humans. What does that mean for our future? https://theconversation.com/ai-systems-have-learned-how-to-deceive-humans-what-does-that-mean-for-our-future-212197 1 comment
- Sycophancy in Generative-AI Chatbots https://www.nngroup.com/articles/sycophancy-generative-ai-chatbots/ 0 comments
- GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
- GitHub - alopatenko/LLMEvaluation: A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods. https://github.com/alopatenko/LLMEvaluation 0 comments
- GitHub - dair-ai/ML-Papers-of-the-Week: 🔥Highlighting the top ML papers every week. https://github.com/dair-ai/ML-Papers-of-the-Week 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2308.14752] AI Deception: A Survey of Examples, Risks, and Potential Solutions
See how to search.