Punishing AI doesn't stop it from lying and cheating — it just makes it hide its true intent better | Live Science - discu.eu

Reddit

Scientists at OpenAI have attempted to stop a frontier AI model from cheating and lying by punishing it. But this just taught it to scheme more privately. https://www.livescience.com/technology/artificial-intelligence/punishing-ai-doesnt-stop-it-from-lying-and-cheating-it-just-makes-it-hide-its-true-intent-better-study-shows 356 comments 23/3/2025 futurology

Scientists at OpenAI have attempted to stop a frontier AI model from cheating and lying by punishing it. But this just taught it to scheme more privately. https://www.livescience.com/technology/artificial-intelligence/punishing-ai-doesnt-stop-it-from-lying-and-cheating-it-just-makes-it-hide-its-true-intent-better-study-shows 106 comments 18/3/2025 technews
Scientists at OpenAI have attempted to stop a frontier AI model from cheating and lying by punishing it. But this just taught it to scheme more privately. https://www.livescience.com/technology/artificial-intelligence/punishing-ai-doesnt-stop-it-from-lying-and-cheating-it-just-makes-it-hide-its-true-intent-better-study-shows 4 comments 18/3/2025 technology

Linking pages

Linked pages

Related searches:

Search whole site: site:www.livescience.com

Search title: Punishing AI doesn't stop it from lying and cheating — it just makes it hide its true intent better | Live Science

See how to search.

Submit link to: