- True Off-Policy Policy-Gradient algorithm https://arxiv.org/abs/1811.09013 6 comments reinforcementlearning
- Implementing Reinforcement learning policy gradient algorithms in Matlab https://www.reddit.com/r/matlab/comments/9m5mgk/implementing_reinforcement_learning_policy/ 8 comments matlab
- A Closer Look at Invalid Action Masking in Policy Gradient Algorithms https://costa.sh/blog-a-closer-look-at-invalid-action-masking-in-policy-gradient-algorithms.html 24 comments reinforcementlearning
- UberAI: Genetic algorithms can solve deep reinforcement learning problems as well as popular alternatives, such as deep Q-learning and policy gradients. https://arxiv.org/abs/1712.06560 7 comments reinforcementlearning