- Against policy gradients/REINFORCE http://www.argmin.net/2018/02/20/reinforce/ 4 comments reinforcementlearning
Linking pages
- An Outsider's Tour of Reinforcement Learning – arg min blog http://www.argmin.net/2018/06/25/outsider-rl/ 9 comments
- A Model, You Know What I Mean? – arg min blog http://www.argmin.net/2018/02/26/nominal/ 3 comments
- An Outsider's Tour of Reinforcement Learning – arg min blog http://benjamin-recht.github.io/2018/06/25/outsider-rl/ 0 comments
Linked pages
- OpenAI Baselines: DQN https://blog.openai.com/openai-baselines-dqn/ 36 comments
- A Model, You Know What I Mean? – arg min blog http://www.argmin.net/2018/02/26/nominal/ 3 comments
- Make It Happen – arg min blog http://www.argmin.net/2018/01/29/taxonomy/ 0 comments
- GitHub - HIPS/autograd: Efficiently computes derivatives of numpy code. https://github.com/HIPS/autograd 0 comments
Related searches:
Search whole site: site:www.argmin.net
Search title: The Policy of Truth – arg min blog
See how to search.