Q-learning - Wikipedia - discu.eu

Hacker News

Q-Learning https://en.wikipedia.org/wiki/Q-learning 33 comments 13/8/2019

Reddit

What's the deal with gamma? When should you want it to not be ~0.9? How does this change with batching? https://en.wikipedia.org/wiki/Q-learning#Discount_factor 5 comments 27/6/2019 reinforcementlearning

Linking pages

Top-down approach to Machine Learning (Updated 2019) https://www.zeroequalsfalse.press/2017/08/10/ml/ 48 comments
A/B testing long-form readability on Gwern.net · Gwern.net http://www.gwern.net/AB%20testing#training-a-neural-net-to-generate-css 39 comments
Reinforcement Learning from scratch | by Emmanuel Ameisen | Insight https://blog.insightdatascience.com/reinforcement-learning-from-scratch-819b65f074d8 25 comments
Solving Path of Exile item crafting with Reinforcement Learning · Denny's Blog https://dennybritz.com/posts/poe-crafting/ 19 comments
How To Think Real Good | Meta-rationality https://meaningness.com/metablog/how-to-think 9 comments
Why we need general AI and why we're not there yet - The Data Scientist https://thedatascientist.com/general-ai/ 6 comments
Tackling Open Challenges in Offline Reinforcement Learning – Google AI Blog https://ai.googleblog.com/2020/08/tackling-open-challenges-in-offline.html 6 comments
Training Generalist Agents with Multi-Game Decision Transformers – Google AI Blog https://ai.googleblog.com/2022/07/training-generalist-agents-with-multi.html 6 comments
Top-down approach to Machine Learning (Updated 2019) https://zeroequalsfalse.press/posts/machine-learning-introduction/ 5 comments
Demystifying deep reinforcement learning – TechTalks https://bdtechtalks.com/2021/09/02/deep-reinforcement-learning-explainer/amp 4 comments
ai-playground/analog at master · sy2002/ai-playground · GitHub https://github.com/sy2002/ai-playground/tree/master/analog 4 comments
Scalable Deep Reinforcement Learning for Robotic Manipulation – Google AI Blog https://ai.googleblog.com/2018/06/scalable-deep-reinforcement-learning.html 3 comments
GitHub - lsunsi/markovjs: Reinforcement Learning in JavaScript https://github.com/lsunsi/markovjs 3 comments
GitHub - chncyhn/flappybird-qlearning-bot: Flappy Bird Bot using Reinforcement Learning https://github.com/chncyhn/flappybird-qlearning-bot 2 comments
Scaling Multi-Agent Reinforcement Learning – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2018/12/12/rllib/ 2 comments
Season Of KDE 2025 Conclusion - KDE Mentorship https://mentorship.kde.org/blog/2025-05-04-sok-conclusion/ 2 comments
An Open Source Tool for Scaling Multi-Agent Reinforcement Learning - RISE Lab https://rise.cs.berkeley.edu/blog/scaling-multi-agent-rl-with-rllib/ 1 comment
From 30 to 11 Lines of Code: Rock Paper Scissors in Python | by David Amos | Better Programming https://betterprogramming.pub/from-30-to-11-lines-of-code-rock-paper-scissors-in-python-5bfa4313a8a7 1 comment
Drifting Efficiently Through the Stratosphere Using Deep Reinforcement Learning | by Salvatore Candido | X, the moonshot factory https://medium.com/loon-for-all/drifting-efficiently-through-the-stratosphere-using-deep-reinforcement-learning-c38723ee2e90 1 comment
These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project | WIRED https://www.wired.com/story/fast-forward-clues-hint-openai-shadowy-q-project/ 1 comment

Related searches:

Search whole site: site:wikipedia.org

Search title: Q-learning - Wikipedia

See how to search.

Submit link to: