Learning Montezuma's Revenge from a Single Demonstration - discu.eu

Hacker News

Learning ‘Montezuma’s Revenge’ from a single demonstration https://blog.openai.com/learning-montezumas-revenge-from-a-single-demonstration/ 45 comments 4/7/2018

Reddit

"Learning Montezuma's Revenge from a Single Demonstration", Salimans & Chen {OA} [PPO with backward chaining from reward state for curriculum learning] https://blog.openai.com/learning-montezumas-revenge-from-a-single-demonstration/ 3 comments 4/7/2018 reinforcementlearning

Linking pages

Linked pages

Related searches:

Search whole site: site:blog.openai.com

Search title: Learning Montezuma's Revenge from a Single Demonstration

See how to search.

Submit link to: