What is reinforcement learning

AMD uses AI (Reinforcement Learning) to optimize their graphics drivers None 3 comments 20/4/2023 linux_gaming

_Distributional Reinforcement Learning_, Bellemare et al 2021 {DM} (draft book) https://www.distributional-rl.org/ 4 comments 16/12/2021 reinforcementlearning

Reinforcement Learning At Facebook https://corecursive.com/061-reinforcement-learning/ 18 comments 4/2/2021 programming

Simulating SQL Injection Exploitation Using Reinforcement Learning https://portswigger.net/daily-swig/machine-learning-offers-fresh-approach-to-tackling-sql-injection-vulnerabilities 5 comments 30/1/2021 netsec

"Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning", De Asis et al 2019 https://arxiv.org/abs/1909.03906 9 comments 28/9/2019 reinforcementlearning

Deep Reinforcement Learning for Factorio https://www.reddit.com/r/factorio/comments/c7zay3/deep_reinforcement_learning_for_factorio/ 30 comments 1/7/2019 factorio

Evolving Rewards to Automate Reinforcement Learning https://arxiv.org/abs/1905.07628 3 comments 21/5/2019 reinforcementlearning

Shared Autonomy via Deep Reinforcement Learning https://arxiv.org/abs/1802.01744 5 comments 26/2/2019 reinforcementlearning

How to fix reinforcement learning https://thegradient.pub/how-to-fix-rl/ 3 comments 9/7/2018 reinforcementlearning

Bioloid walking after reinforcement learning - version 3 https://www.youtube.com/watch?amp%3Bv=O2rx4Bdwn24&time_continue=52 13 comments 2/2/2018 robotics

"Imagination-Augmented Agents for Deep Reinforcement Learning", Weber et al 2017 {DM} https://arxiv.org/abs/1707.06203 3 comments 21/7/2017 reinforcementlearning

Beating the World’s Best at Super Smash Bros. Melee with Deep Reinforcement Learning https://arxiv.org/pdf/1702.06230.pdf 3 comments 1/3/2017 artificial

Introduction to Making a Simple Game AI with Deep Reinforcement Learning https://keon.io/rl/deep-q-learning-with-keras-and-gym/ 17 comments 6/2/2017 gamedev

Intro to Deep Reinforcement Learning http://www.nervanasys.com/demystifying-deep-reinforcement-learning/ 4 comments 26/3/2016 programming

[P] Offline reinforcement learning - 10x faster than SOTA with evolutionary HPO https://github.com/AgileRL/AgileRL 10 comments 24/5/2023 machinelearning

[P] Reinforcement learning evolutionary hyperparameter optimization - 10x speed up https://github.com/AgileRL/AgileRL 26 comments 24/3/2023 machinelearning

Can reinforcement learning learn itself? A reply to 'Reward is enough' (PDF) https://philpapers.org/archive/ALECRL.pdf 5 comments 21/4/2022 reinforcementlearning

Reinforcement Learning: An Introduction, 2nd edition by Richard S. Sutton and Andrew G. Barto (free pdf) [examples in common lisp] http://www.incompleteideas.net/book/the-book-2nd.html 5 comments 9/4/2022 lisp

[D] How to compute the probability of trajectories term in Stochastic Gradient Meta Reinforcement Learning https://stats.stackexchange.com/questions/568495/how-to-compute-the-probability-of-trajectories-term-in-stochastic-gradient-meta 4 comments 21/3/2022 machinelearning

A.I. Learns - Walk or Run in the Rain? (150.000+ trials) - Reinforcement Learning + Unity https://www.youtube.com/watch?v=V_GidhS-hzo 11 comments 22/3/2020 learnmachinelearning

[P] Reinforcement Learning with Pytorch - Free course https://www.udemy.com/reinforcement-learning-with-pytorch/?couponCode=AI-PROMO-REDDIT 11 comments 20/6/2018 reinforcementlearning

[P] Playing Atari with deep reinforcement learning - our approach https://deepsense.ai/playing-atari-with-deep-reinforcement-learning-deepsense-ais-approach/ 3 comments 19/6/2018 reinforcementlearning

"A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem", Jiang et al 2017 https://arxiv.org/abs/1706.10059 5 comments 5/7/2017 reinforcementlearning

Practical PyTorch: GridWorld with Reinforcement Learning (Policy Gradients with REINFORCE / Actor-Critic) https://github.com/spro/practical-pytorch/blob/master/reinforce-gridworld/reinforce-gridworld.ipynb 3 comments 18/6/2017 learnmachinelearning

Inside DeepMind - upcoming Nature paper on "Human-level control through deep reinforcement learning" in Atari games http://www.33rdsquare.com/2015/02/inside-deep-mind.html 7 comments 25/2/2015 artificial

[M] AI learns to play SNAKE using Reinforcement Learning Part 2 https://youtu.be/WjuLQVg04JY 5 comments 9/8/2019 robotics

Spinning Up in Deep RL - "...an educational resource produced by OpenAI that makes it easier to learn about deep reinforcement learning (deep RL)." https://blog.openai.com/spinning-up-in-deep-rl/ 3 comments 9/11/2018 learnmachinelearning

[R] The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning https://arxiv.org/abs/2108.02755 3 comments 8/8/2021 machinelearning

US Army Researchers Develop A New Framework For Collaborative Multi-Agent Reinforcement Learning Systems https://www.spiedigitallibrary.org/conference-proceedings-of-spie/11746/2585808/Survey-of-recent-multi-agent-reinforcement-learning-algorithms-utilizing-centralized/10.1117/12.2585808.short?SSO=1&tab=ArticleLinkCited 3 comments 22/6/2021 reinforcementlearning

AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning https://deepmind.com/blog/article/alphastar-grandmaster-level-in-starcraft-ii-using-multi-agent-reinforcement-learning 94 comments 31/10/2019 programming

Researchers From Princeton And Max Planck Developed A Reinforcement Learning–Based Simulation That Shows The Human Desire Always To Want More May Have Evolved As A Way To Speed Up Learning https://www.marktechpost.com/2022/08/06/researchers-from-princeton-and-max-planck-developed-a-reinforcement-learning-based-simulation-that-shows-the-human-desire-always-to-want-more-may-have-evolved-as-a-way-to-speed-up-learning/ 2 comments 7/8/2022 machinelearningnews

[Project] Library for offline model-based reinforcement learning https://github.com/Mr-Pepe/offline-model-based-rl 2 comments 3/10/2022 machinelearning

Hi, I need a little bit of advice. I would like to implement a reinforcement learning agent for solving the „Job Shop Problem“. Is here anybody who can tell me how I should start implementing the agent. Is q learning suitable for the problem? https://developers.google.com/optimization/scheduling/job_shop#output 6 comments 30/6/2021 reinforcementlearning