- "Deep Reinforcement Learning that Matters", Henderson et al 2017 [on reproducible & statistically-significant comparisons: high variance in deep RL due to random seeds, hyperparameter optimization, performance metric choices, implementation] https://arxiv.org/abs/1709.06560 3 comments reinforcementlearning
Linking pages
- Deep Reinforcement Learning Doesn't Work Yet https://www.alexirpan.com/2018/02/14/rl-hard.html 89 comments
- The Turing Bot | The Topics I Would Choose If I Ever Did A PhD in AI/ML https://turing-bot.com/posts/masters-degree-new-learn 12 comments
- GitHub - ikostrikov/pytorch-a2c-ppo-acktr-gail: PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail 7 comments
- Our NIPS 2017: Learning to Run approach | by Adam Stelmaszczyk | ML Review https://medium.com/@stelmaszczykadam/our-nips-2017-learning-to-run-approach-b80a295d3bb5 4 comments
- Clues for Which I Search and Choose – arg min blog http://benjamin-recht.github.io/2018/03/20/mujocoloco/ 2 comments
- Make It Happen – arg min blog http://www.argmin.net/2018/01/29/taxonomy/ 0 comments
- Clues for Which I Search and Choose – arg min blog http://www.argmin.net/2018/03/20/mujocoloco/ 0 comments
- RLiable: Towards Reliable Evaluation & Reporting in Reinforcement Learning – Google AI Blog https://ai.googleblog.com/2021/11/rliable-towards-reliable-evaluation.html 0 comments
- Reinforcement Learning: A Deep Dive | Toptal https://www.toptal.com/machine-learning/deep-dive-into-reinforcement-learning 0 comments
- GitHub - inoryy/reaver: Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo. https://github.com/inoryy/reaver-pysc2 0 comments
- Recursive Classification: Replacing Rewards with Examples in RL – Google AI Blog https://ai.googleblog.com/2021/03/recursive-classification-replacing.html 0 comments
- Learning to Drive Smoothly in Minutes | by Antonin RAFFIN | Towards Data Science https://towardsdatascience.com/learning-to-drive-smoothly-in-minutes-450a7cdb35f4 0 comments
- GitHub - ikostrikov/pytorch-a2c-ppo-acktr-gail: PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). https://github.com/ikostrikov/pytorch-a2c-ppo-acktr 0 comments
- A solution to the Deep Learning reproducibility crisis | by NuronLabs | Medium https://medium.com/@nuronlabs/a-solution-to-the-deep-learning-reproducibility-crisis-3c760703e214 0 comments
- Unawareness of Deep Learning Mistakes | by Yuxin Wu | Medium https://medium.com/@ppwwyyxx/unawareness-of-deep-learning-mistakes-d5b5774da0ba 0 comments
- Opportunities in Deep Learning. Although the Garter hype cycle believes… | by NuronLabs | Medium https://medium.com/@nuronlabs/opportunities-in-deep-learning-d7088eedcc47 0 comments
- Making Peace with LLM Non-determinism https://barryzhang.substack.com/p/making-peace-with-llm-non-determinism 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1709.06560] Deep Reinforcement Learning that Matters
See how to search.