Hacker News
- Playing Atari with Deep Reinforcement Learning [pdf] http://www.cs.toronto.edu/~vmnih/docs/dqn.pdf 7 comments
Linking pages
- To Understand Language is to Understand Generalization | Eric Jang https://evjang.com/2021/12/17/lang-generalization.html 38 comments
- The Deep Mind of Demis Hassabis. Google’s prize AI prodigy tells all | by Steven Levy | Backchannel | Medium https://medium.com/backchannel/the-deep-mind-of-demis-hassabis-156112890d8a 13 comments
- Tuning Recurrent Neural Networks with Reinforcement Learning https://magenta.tensorflow.org/2016/11/09/tuning-recurrent-networks-with-reinforcement-learning/ 11 comments
- The Dependency Valley – Jared Porcenaluk http://www.jaredporcenaluk.com/the-dependency-valley/ 10 comments
- GitHub - keiohta/tf2rl: TensorFlow2 Reinforcement Learning https://github.com/keiohta/tf2rl 9 comments
- The 32 Implementation Details of Proximal Policy Optimization (PPO) Algorithm https://costa.sh/blog-the-32-implementation-details-of-ppo.html 9 comments
- multi-agent-rl/README.md at master · rohan-sawhney/multi-agent-rl · GitHub https://github.com/rohan-sawhney/multi-agent-rl/blob/master/README.md 5 comments
- How The Hell Do We Create General-Purpose Robots? https://howthehell.substack.com/p/general-purpose-robots 4 comments
- Creating Deep Neural Networks from Scratch, an Introduction to Reinforcement Learning | by Abhav Kedia | Towards Data Science https://towardsdatascience.com/creating-deep-neural-networks-from-scratch-an-introduction-to-reinforcement-learning-6bba874019db 4 comments
- Bayesian Deep Learning — While My MCMC Gently Samples http://twiecki.github.io/blog/2016/06/01/bayesian-deep-learning/ 3 comments
- Deep Reinforcement Learning: Playing a Racing Game - Byte Tank https://lopespm.github.io/machine_learning/2016/10/06/deep-reinforcement-learning-racing-game.html 1 comment
- GitHub - nileshsah/reinforcement-learning-flappybird: In-browser reinforcement learning for flappy bird 🐦 https://github.com/nileshsah/reinforcement-learning-flappybird 1 comment
- GitHub - the5avage/Q-Learning: Q-Learning for temperature control https://github.com/the5avage/Q-Learning 1 comment
- BYOBBO. Build your own black-box optimizer | by David Sweet | Medium https://medium.com/@davidsweet_85241/byobbo-a827ae0fafa 0 comments
- Applications of Reinforcement Learning in Real World | by Gary Chan | Towards Data Science https://towardsdatascience.com/applications-of-reinforcement-learning-in-real-world-1a94955bcd12 0 comments
- PyTorch Tutorials: Teaching AI How to Play Flappy Bird | Toptal https://www.toptal.com/deep-learning/pytorch-reinforcement-learning-tutorial 0 comments
- The Visual Task Adaptation Benchmark – Google AI Blog https://ai.googleblog.com/2019/11/the-visual-task-adaptation-benchmark.html 0 comments
- GitHub - banditml/banditml: A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services. https://github.com/banditml/banditml 0 comments
- I built a LinearRegression that can play Pong with me. | by Diego Aguado | HackerNoon.com | Medium https://medium.com/@diegoagher/i-built-a-linearregression-that-can-play-pong-with-me-7b00d73f3fcc 0 comments
- Introduction to Fiber - Fiber https://uber.github.io/fiber/introduction/ 0 comments
Related searches:
Search whole site: site:cs.toronto.edu
Search title: Playing Atari with Deep Reinforcement Learning [pdf]
See how to search.