Learning Montezuma's Revenge from a Single Demonstration - discu.eu

Hacker News

Learning ‘Montezuma’s Revenge’ from a single demonstration https://blog.openai.com/learning-montezumas-revenge-from-a-single-demonstration/ 45 comments 4/7/2018

Reddit

"Learning Montezuma's Revenge from a Single Demonstration", Salimans & Chen {OA} [PPO with backward chaining from reward state for curriculum learning] https://blog.openai.com/learning-montezumas-revenge-from-a-single-demonstration/ 3 comments 4/7/2018 reinforcementlearning