Hacker News
- Avoiding fusion plasma tearing instability with deep reinforcement learning https://www.nature.com/articles/s41586-024-07024-9 124 comments
- Algorithms for Reinforcement Learning https://sites.ualberta.ca/~szepesva/RLBook.html 19 comments
- Reinforcement Learning: An Introduction (2018) [pdf] http://incompleteideas.net/book/RLbook2018.pdf 23 comments
- Dopamine framework: Fast prototyping of reinforcement learning algorithms (2018) https://ai.googleblog.com/2018/08/introducing-new-framework-for-flexible.html 15 comments
- Practical RL – A course in reinforcement learning in the wild https://github.com/yandexdataschool/Practical_RL 4 comments
- PlaNet: A Deep Planning Network for Reinforcement Learning https://ai.googleblog.com/2019/02/introducing-planet-deep-planning.html 3 comments
- Offline reinforcement learning - 10x faster than SOTA with evolutionary HPO https://github.com/AgileRL/AgileRL 13 comments reinforcementlearning
- AMD uses AI (Reinforcement Learning) to optimize their graphics drivers None 3 comments linux_gaming
- [P] 10x faster reinforcement learning HPO - now with CNNs! https://github.com/AgileRL/AgileRL 28 comments machinelearning
- _Distributional Reinforcement Learning_, Bellemare et al 2021 {DM} (draft book) https://www.distributional-rl.org/ 4 comments reinforcementlearning
- Reinforcement Learning At Facebook https://corecursive.com/061-reinforcement-learning/ 18 comments programming
- Simulating SQL Injection Exploitation Using Reinforcement Learning https://portswigger.net/daily-swig/machine-learning-offers-fresh-approach-to-tackling-sql-injection-vulnerabilities 5 comments netsec
- Reinforcement learning’s foundational flaw https://thegradient.pub/why-rl-is-flawed/ 6 comments artificial
- "Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning", De Asis et al 2019 https://arxiv.org/abs/1909.03906 9 comments reinforcementlearning
- Deep Reinforcement Learning for Factorio https://www.reddit.com/r/factorio/comments/c7zay3/deep_reinforcement_learning_for_factorio/ 30 comments factorio
- Evolving Rewards to Automate Reinforcement Learning https://arxiv.org/abs/1905.07628 3 comments reinforcementlearning
- Shared Autonomy via Deep Reinforcement Learning https://arxiv.org/abs/1802.01744 5 comments reinforcementlearning
- How to fix reinforcement learning https://thegradient.pub/how-to-fix-rl/ 3 comments reinforcementlearning
- "Imagination-Augmented Agents for Deep Reinforcement Learning", Weber et al 2017 {DM} https://arxiv.org/abs/1707.06203 3 comments reinforcementlearning
- Beating the World’s Best at Super Smash Bros. Melee with Deep Reinforcement Learning https://arxiv.org/pdf/1702.06230.pdf 3 comments artificial
- Introduction to Making a Simple Game AI with Deep Reinforcement Learning https://keon.io/rl/deep-q-learning-with-keras-and-gym/ 17 comments gamedev
- Intro to Deep Reinforcement Learning http://www.nervanasys.com/demystifying-deep-reinforcement-learning/ 4 comments programming
- Can reinforcement learning learn itself? A reply to 'Reward is enough' (PDF) https://philpapers.org/archive/ALECRL.pdf 5 comments reinforcementlearning
- Reinforcement Learning: An Introduction, 2nd edition by Richard S. Sutton and Andrew G. Barto (free pdf) [examples in common lisp] http://www.incompleteideas.net/book/the-book-2nd.html 5 comments lisp
- [D] How to compute the probability of trajectories term in Stochastic Gradient Meta Reinforcement Learning https://stats.stackexchange.com/questions/568495/how-to-compute-the-probability-of-trajectories-term-in-stochastic-gradient-meta 4 comments machinelearning
- Looking partners for Reinforcement learning course. https://web.mit.edu/dimitrib/www/RLbook.html 5 comments reinforcementlearning
- [P] Reinforcement Learning with Pytorch - Free course https://www.udemy.com/reinforcement-learning-with-pytorch/?couponCode=AI-PROMO-REDDIT 11 comments reinforcementlearning
- [P] Playing Atari with deep reinforcement learning - our approach https://deepsense.ai/playing-atari-with-deep-reinforcement-learning-deepsense-ais-approach/ 3 comments reinforcementlearning
- Teaching a Catapult to Shoot Down a Missile: First impressions with Unity's new reinforcement learning SDK http://adamashwal.com/catapult 5 comments gamedev
- "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem", Jiang et al 2017 https://arxiv.org/abs/1706.10059 5 comments reinforcementlearning
- Practical PyTorch: GridWorld with Reinforcement Learning (Policy Gradients with REINFORCE / Actor-Critic) https://github.com/spro/practical-pytorch/blob/master/reinforce-gridworld/reinforce-gridworld.ipynb 3 comments learnmachinelearning
- markovjs - Reinforcement Learning in Javascript [little lib I just coded] https://github.com/lsunsi/markovjs 3 comments webdev
- Inside DeepMind - upcoming Nature paper on "Human-level control through deep reinforcement learning" in Atari games http://www.33rdsquare.com/2015/02/inside-deep-mind.html 7 comments artificial
- Spinning Up in Deep RL - "...an educational resource produced by OpenAI that makes it easier to learn about deep reinforcement learning (deep RL)." https://blog.openai.com/spinning-up-in-deep-rl/ 3 comments learnmachinelearning
- [R] The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning https://arxiv.org/abs/2108.02755 3 comments machinelearning
- US Army Researchers Develop A New Framework For Collaborative Multi-Agent Reinforcement Learning Systems https://www.spiedigitallibrary.org/conference-proceedings-of-spie/11746/2585808/Survey-of-recent-multi-agent-reinforcement-learning-algorithms-utilizing-centralized/10.1117/12.2585808.short?SSO=1&tab=ArticleLinkCited 3 comments reinforcementlearning
- AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning https://deepmind.com/blog/article/alphastar-grandmaster-level-in-starcraft-ii-using-multi-agent-reinforcement-learning 94 comments programming
- Researchers From Princeton And Max Planck Developed A Reinforcement Learning–Based Simulation That Shows The Human Desire Always To Want More May Have Evolved As A Way To Speed Up Learning https://www.marktechpost.com/2022/08/06/researchers-from-princeton-and-max-planck-developed-a-reinforcement-learning-based-simulation-that-shows-the-human-desire-always-to-want-more-may-have-evolved-as-a-way-to-speed-up-learning/ 2 comments machinelearningnews
- Hi, I need a little bit of advice. I would like to implement a reinforcement learning agent for solving the „Job Shop Problem“. Is here anybody who can tell me how I should start implementing the agent. Is q learning suitable for the problem? https://developers.google.com/optimization/scheduling/job_shop#output 6 comments reinforcementlearning
- An artificial intelligence program called AlphaStar now ranks among the top 0.2% of human players for the strategy game StarCraft II. Reported in Nature this week, the algorithm represents a major achievement for machine learning, multi-agent reinforcement learning https://www.nature.com/articles/s41586-019-1724-z.epdf?shared_access_token=eeinxbhkk8z48e6x6fhzvdrgn0jajwel9jnr3zotv0pszcpzjfgnazholk4debckav0uumyg1zcvyjtjgsnl-x-42q3c4krjbwlioqpxrjaik4lbpapbj-nfrj4lklrar9u1vpqf2aprrhsogwhs1w%3D%3D 41 comments science