[2308.14752] AI Deception: A Survey of Examples, Risks, and Potential Solutions - discu.eu

Reddit

"AI Deception: A Survey of Examples, Risks, and Potential Solutions", Park et al 2023 https://arxiv.org/abs/2308.14752 6 comments 3/6/2024 reinforcementlearning

Linking pages

AI systems have learned how to deceive humans. What does that mean for our future? https://theconversation.com/ai-systems-have-learned-how-to-deceive-humans-what-does-that-mean-for-our-future-212197 1 comment
Sycophancy in Generative-AI Chatbots https://www.nngroup.com/articles/sycophancy-generative-ai-chatbots/ 0 comments
GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
GitHub - alopatenko/LLMEvaluation: A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods. https://github.com/alopatenko/LLMEvaluation 0 comments
GitHub - dair-ai/ML-Papers-of-the-Week: 🔥Highlighting the top ML papers every week. https://github.com/dair-ai/ML-Papers-of-the-Week 0 comments

Related searches:

Search whole site: site:arxiv.org

Search title: [2308.14752] AI Deception: A Survey of Examples, Risks, and Potential Solutions

See how to search.

Submit link to: