Hacker News
- Misalignment and Deception by an autonomous stock trading LLM agent https://arxiv.org/abs/2311.07590 34 comments
- Large Language Models Can Strategically Deceive Their Users When Under Pressure https://arxiv.org/abs/2311.07590 2 comments
- [R] With or without a scratchpad, Large Language Models can Strategically Deceive their Users when Put Under Pressure. Results of an autonomous stock trading agent in a realistic, simulated environment. https://arxiv.org/abs/2311.07590 39 comments machinelearning
Linking pages
- OpenAI No Longer Takes Safety Seriously | Lawfare https://www.lawfaremedia.org/article/openai-no-longer-takes-safety-seriously 82 comments
- ChatGPT can strategically deceive you — but only if you pile on the pressure | Live Science https://www.livescience.com/technology/artificial-intelligence/chatgpt-will-lie-cheat-and-use-insider-trading-when-under-pressure-to-make-money-research-shows 46 comments
- GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2311.07590] Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
See how to search.