- Why don't policies over large action spaces also have to "optimize"? https://arxiv.org/abs/1509.02971 7 comments reinforcementlearning
- Why do rewards start to drop after a certain number of episodes? https://arxiv.org/abs/1509.02971 8 comments reinforcementlearning
Linking pages
- GitHub - terryum/awesome-deep-learning-papers: The most cited deep learning papers https://github.com/terryum/awesome-deep-learning-papers 47 comments
- Learning to Cooperate, Compete, and Communicate https://blog.openai.com/learning-to-cooperate-compete-and-communicate/?source=hn 36 comments
- Reinforcement Learning (DQN) Tutorial — PyTorch Tutorials 2.0.0+cu117 documentation https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html 29 comments
- Deep-Learning-Papers-Reading-Roadmap/README.md at master · floodsung/Deep-Learning-Papers-Reading-Roadmap · GitHub https://github.com/songrotek/deep-learning-papers-reading-roadmap 29 comments
- ROBEL: Robotics Benchmarks for Learning with Low-Cost Robots – Google AI Blog https://ai.googleblog.com/2019/10/robel-robotics-benchmarks-for-learning.html 25 comments
- GitHub - astorfi/Deep-Learning-Roadmap: Organized Resources for Deep Learning Researchers and Developers https://github.com/astorfi/Deep-Learning-World 22 comments
- GitHub - dennybritz/reinforcement-learning: Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course. https://github.com/dennybritz/reinforcement-learning 20 comments
- GitHub - higgsfield/RL-Adventure-2: PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay https://github.com/higgsfield/RL-Adventure-2 20 comments
- TDM: From Model-Free to Model-Based Deep Reinforcement Learning – The Berkeley Artificial Intelligence Research Blog http://bair.berkeley.edu/blog/2018/04/26/tdm/ 11 comments
- GitHub - marload/DeepRL-TensorFlow2: 🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2 https://github.com/marload/deep-rl-tf2 10 comments
- GitHub - keiohta/tf2rl: TensorFlow2 Reinforcement Learning https://github.com/keiohta/tf2rl 9 comments
- GitHub - p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch: PyTorch implementations of deep reinforcement learning algorithms and environments https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch 9 comments
- GitHub - chaovven/PyRL: PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.) https://github.com/chaovven/PyRL 8 comments
- GitHub - ghliu/pytorch-ddpg: Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch https://github.com/ghliu/pytorch-ddpg 8 comments
- Computational Missile Guidance: A Deep Reinforcement Learning Approach | Journal of Aerospace Information Systems https://arc.aiaa.org/doi/10.2514/1.I010970 7 comments
- Introducing Huskarl: The Modular Deep Reinforcement Learning Framework | by TensorFlow | Medium https://medium.com/@tensorflow/introducing-huskarl-the-modular-deep-reinforcement-learning-framework-e47d4b228dd3 6 comments
- multi-agent-rl/README.md at master · rohan-sawhney/multi-agent-rl · GitHub https://github.com/rohan-sawhney/multi-agent-rl/blob/master/README.md 5 comments
- GitHub - floodsung/Deep-Learning-Papers-Reading-Roadmap: Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech! https://github.com/floodsung/Deep-Learning-Papers-Reading-Roadmap 5 comments
- GitHub - medipixel/rl_algorithms: Structural implementation of RL key algorithms https://github.com/medipixel/rl_algorithms 5 comments
- Long-Range Robotic Navigation via Automated Reinforcement Learning – Google AI Blog https://ai.googleblog.com/2019/02/long-range-robotic-navigation-via.html 3 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1509.02971] Continuous control with deep reinforcement learning
See how to search.