Linking pages
Linked pages
- https://deepmind.com/blog/alphastar-mastering-real-time-strategy-game-starcraft-ii/ 508 comments
- Sokoban - Wikipedia https://en.wikipedia.org/wiki/Sokoban 59 comments
- Faulty Reward Functions in the Wild https://openai.com/blog/faulty-reward-functions 17 comments
- https://arxiv.org/pdf/1805.12114.pdf 5 comments
- Baba Is You https://hempuli.com/baba/ 4 comments
- [1606.06565] Concrete Problems in AI Safety https://arxiv.org/abs/1606.06565 3 comments
- Specification gaming examples in AI | Victoria Krakovna https://vkrakovna.wordpress.com/2018/04/02/specification-gaming-examples-in-ai/ 1 comment
- https://deepmind.com/blog/specifying-ai-safety-problems/ 1 comment
- GOEDEL MACHINE HOME PAGE https://people.idsia.ch/~juergen/goedelmachine.html 1 comment
- Understanding Agent Incentives with Causal Influence Diagrams | by DeepMind Safety Research | Medium https://medium.com/@deepmindsafetyresearch/understanding-agent-incentives-with-causal-influence-diagrams-7262c2512486 1 comment
- Scalable agent alignment via reward modeling | by DeepMind Safety Research | Medium https://medium.com/@deepmindsafetyresearch/scalable-agent-alignment-via-reward-modeling-bf4ab06dfd84 0 comments
- https://deepmind.com/research/alphago/ 0 comments
- [1906.08663] Modeling AGI Safety Frameworks with Causal Influence Diagrams https://arxiv.org/abs/1906.08663 0 comments
- [1711.09883] AI Safety Gridworlds https://arxiv.org/abs/1711.09883 0 comments
Related searches:
Search whole site: site:medium.com
Search title: Designing agent incentives to avoid reward tampering | by DeepMind Safety Research | Medium
See how to search.