Hacker News
- Step by step tutorial to understand multi-armed bandit http://karpathy.github.io/2016/05/31/rl/ 2 comments reinforcementlearning
Linking pages
- Concrete AI Safety Problems https://openai.com/blog/concrete-ai-safety-problems/ 94 comments
- Infrastructure for Deep Learning https://openai.com/blog/infrastructure-for-deep-learning/ 67 comments
- #define CTO OpenAI https://blog.gregbrockman.com/define-cto-openai 62 comments
- Evolution as Backstop for Reinforcement Learning · Gwern.net https://www.gwern.net/Backstop#pain-is-the-only-school-teacher 61 comments
- Generative Models https://openai.com/blog/generative-models/ 60 comments
- Lessons Learned Reproducing a Deep Reinforcement Learning Paper http://amid.fish/reproducing-deep-rl 37 comments
- How to Start Learning Deep Learning – Ofir Press http://ofir.io/How-to-Start-Learning-Deep-Learning/ 29 comments
- GitHub - andri27-ts/Reinforcement-Learning: Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning https://github.com/andri27-ts/60_Days_RL_Challenge 22 comments
- ML Resources https://sgfin.github.io/learning-resources/ 21 comments
- Machine Learning is Fun! Part 3: Deep Learning and Convolutional Neural Networks | by Adam Geitgey | Medium https://medium.com/@ageitgey/machine-learning-is-fun-part-3-deep-learning-and-convolutional-neural-networks-f40359318721#.m77iqtjqu 18 comments
- Generative Models https://blog.openai.com/generative-models/ 15 comments
- Using Keras and Deep Deterministic Policy Gradient to play TORCS | Ben Lau https://yanpanlau.github.io/2016/10/11/Torcs-Keras.html 14 comments
- Policy Gradient Reinforcement Learning in PyTorch | by Tim Sullivan | Medium https://medium.com/@ts1829/policy-gradient-reinforcement-learning-in-pytorch-df1383ea0baf#5807 6 comments
- How to Create A Concurrent and Parallel Stochastic Reinforcement Learning Environment For Crypto Trading | by Kevin Hill | Medium https://medium.com/@kevinhill_96608/how-to-create-a-concurrent-and-parallel-stochastic-reinforcement-learning-environment-for-crypto-3756d78b7a8e?sk=f1b3321cbee3f42004eee87285eae27a&source=friends_link 6 comments
- Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
- 4 Steps for Learning Deep Learning | by Vivek Kumar | Medium https://medium.com/@vzkuma/4-steps-for-learning-deep-learning-86f11fcee54#.6nrtkcrn0 2 comments
- ApproxiPong | An informal review of reinfocement learning algorithms using deep learning methods. https://jonathanfiat.github.io/ApproxiPong/ 1 comment
- Double Q-Learning Explained - Lukas's Blog https://loreley.one/2024-03-double_q/ 1 comment
- Reinforcement learning without gradients: evolving agents using Genetic Algorithms | by Paras Chopra | Towards Data Science https://towardsdatascience.com/reinforcement-learning-without-gradients-evolving-agents-using-genetic-algorithms-8685817d84f 0 comments
- A GAMEBOY supercomputer. At a total of slightly over 1 billion… | by Kamil Rocki | Towards Data Science https://towardsdatascience.com/a-gameboy-supercomputer-33a6955a79a4 0 comments
Linked pages
- http://arxiv.org/abs/1410.5401 40 comments
- Berkeley’s Preschool for Robots http://www.bloomberg.com/features/2015-preschool-for-robots/ 32 comments
- [1604.00289] Building Machines That Learn and Think Like People https://arxiv.org/abs/1604.00289 22 comments
- https://gym.openai.com/ 18 comments
- https://arxiv.org/abs/1602.01783 7 comments
- Deep Learning for Robots: Learning from Large-Scale Interaction – Google AI Blog http://googleresearch.blogspot.com/2016/03/deep-learning-for-robots-learning-from.html 5 comments
- [1506.02438] High-Dimensional Continuous Control Using Generalized Advantage Estimation https://arxiv.org/abs/1506.02438 3 comments
- AlphaGo: Mastering the ancient game of Go with Machine Learning – Google AI Blog http://googleresearch.blogspot.com/2016/01/alphago-mastering-ancient-game-of-go.html 2 comments
- [1504.00702] End-to-End Training of Deep Visuomotor Policies http://arxiv.org/abs/1504.00702 1 comment
- https://deepmind.com/alpha-go.html 1 comment
- http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html 0 comments
- [1406.6247] Recurrent Models of Visual Attention http://arxiv.org/abs/1406.6247 0 comments
- [1505.00521] Reinforcement Learning Neural Turing Machines - Revised http://arxiv.org/abs/1505.00521 0 comments
- REINFORCEjs: Gridworld with Dynamic Programming http://cs.stanford.edu/people/karpathy/reinforcejs/index.html 0 comments
- Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning http://www.cs.ubc.ca/~van/papers/2016-TOG-deepRL/index.html 0 comments
- Cross-entropy method - Wikipedia https://en.wikipedia.org/wiki/Cross-entropy_method 0 comments
Related searches:
Search whole site: site:karpathy.github.io
Search title: Deep Reinforcement Learning: Pong from Pixels
See how to search.