- What is correct definition of the true Q-value of the policy? https://arxiv.org/abs/1506.02438 3 comments reinforcementlearning
Linking pages
- Competitive Self-Play https://blog.openai.com/competitive-self-play/ 138 comments
- GitHub - higgsfield/RL-Adventure-2: PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay https://github.com/higgsfield/RL-Adventure-2 20 comments
- Deep Reinforcement Learning: Pong from Pixels https://karpathy.github.io/2016/05/31/rl/ 16 comments
- GitHub - keiohta/tf2rl: TensorFlow2 Reinforcement Learning https://github.com/keiohta/tf2rl 9 comments
- Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
- GitHub - qfettes/DeepRL-Tutorials: Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch https://github.com/qfettes/DeepRL-Tutorials 4 comments
- yobibyte's webpage – Reinforcement Learning Summer School (RLSS 2017) https://yobibyte.github.io/rlss17.html 0 comments
- Deep Reinforcement Learning: Pong from Pixels http://karpathy.github.io/2016/05/31/rl/?a=2 0 comments
- AI Gym Workout https://learningai.io/projects/2017/07/28/ai-gym-workout.html 0 comments
- Making Sense of the Bias / Variance Trade-off in (Deep) Reinforcement Learning | by Arthur Juliani | ML Review https://medium.com/mlreview/making-sense-of-the-bias-variance-trade-off-in-deep-reinforcement-learning-79cf1e83d565 0 comments
- Deep Reinforcement Learning: Pong from Pixels http://karpathy.github.io/2016/05/31/rl/?a=1 0 comments
- GitHub - junhyukoh/deep-reinforcement-learning-papers: A list of recent papers regarding deep reinforcement learning https://github.com/junhyukoh/deep-reinforcement-learning-papers 0 comments
- Competitive Self-Play https://openai.com/blog/competitive-self-play/ 0 comments
- Improving Quantum Computation with Classical Machine Learning – Google AI Blog https://ai.googleblog.com/2019/10/improving-quantum-computation-with.html 0 comments
- Reverse Curriculum Generation for Reinforcement Learning Agents – The Berkeley Artificial Intelligence Research Blog http://bair.berkeley.edu/blog/2017/12/20/reverse-curriculum/?href= 0 comments
- Long term credit assignment with temporal reward transport · EFAVDB https://www.efavdb.com/ltca 0 comments
- GitHub - opendilab/DI-engine: OpenDILab Decision AI Engine https://github.com/opendilab/DI-engine 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1506.02438] High-Dimensional Continuous Control Using Generalized Advantage Estimation
See how to search.