discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Hacker News
Learning ‘Montezuma’s Revenge’ from a single demonstration
https://blog.openai.com/learning-montezumas-revenge-from-a-single-demonstration/
45 comments
4/7/2018
Reddit
"Learning Montezuma's Revenge from a Single Demonstration", Salimans & Chen {OA} [PPO with backward chaining from reward state for curriculum learning]
https://blog.openai.com/learning-montezumas-revenge-from-a-single-demonstration/
3 comments
4/7/2018
reinforcementlearning