Hacker News
- What's the deal with gamma? When should you want it to not be ~0.9? How does this change with batching? https://en.wikipedia.org/wiki/Q-learning#Discount_factor 5 comments reinforcementlearning
Linking pages
- Top-down approach to Machine Learning (Updated 2019) https://www.zeroequalsfalse.press/2017/08/10/ml/ 48 comments
- A/B testing long-form readability on Gwern.net · Gwern.net http://www.gwern.net/AB%20testing#training-a-neural-net-to-generate-css 39 comments
- Reinforcement Learning from scratch | by Emmanuel Ameisen | Insight https://blog.insightdatascience.com/reinforcement-learning-from-scratch-819b65f074d8 25 comments
- Solving Path of Exile item crafting with Reinforcement Learning · Denny's Blog https://dennybritz.com/posts/poe-crafting/ 19 comments
- How To Think Real Good | Meta-rationality https://meaningness.com/metablog/how-to-think 9 comments
- Why we need general AI and why we're not there yet - The Data Scientist https://thedatascientist.com/general-ai/ 6 comments
- Tackling Open Challenges in Offline Reinforcement Learning – Google AI Blog https://ai.googleblog.com/2020/08/tackling-open-challenges-in-offline.html 6 comments
- Training Generalist Agents with Multi-Game Decision Transformers – Google AI Blog https://ai.googleblog.com/2022/07/training-generalist-agents-with-multi.html 6 comments
- Top-down approach to Machine Learning (Updated 2019) https://zeroequalsfalse.press/posts/machine-learning-introduction/ 5 comments
- Demystifying deep reinforcement learning – TechTalks https://bdtechtalks.com/2021/09/02/deep-reinforcement-learning-explainer/amp 4 comments
- ai-playground/analog at master · sy2002/ai-playground · GitHub https://github.com/sy2002/ai-playground/tree/master/analog 4 comments
- Scalable Deep Reinforcement Learning for Robotic Manipulation – Google AI Blog https://ai.googleblog.com/2018/06/scalable-deep-reinforcement-learning.html 3 comments
- GitHub - lsunsi/markovjs: Reinforcement Learning in JavaScript https://github.com/lsunsi/markovjs 3 comments
- GitHub - chncyhn/flappybird-qlearning-bot: Flappy Bird Bot using Reinforcement Learning https://github.com/chncyhn/flappybird-qlearning-bot 2 comments
- Scaling Multi-Agent Reinforcement Learning – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2018/12/12/rllib/ 2 comments
- An Open Source Tool for Scaling Multi-Agent Reinforcement Learning - RISE Lab https://rise.cs.berkeley.edu/blog/scaling-multi-agent-rl-with-rllib/ 1 comment
- From 30 to 11 Lines of Code: Rock Paper Scissors in Python | by David Amos | Better Programming https://betterprogramming.pub/from-30-to-11-lines-of-code-rock-paper-scissors-in-python-5bfa4313a8a7 1 comment
- Drifting Efficiently Through the Stratosphere Using Deep Reinforcement Learning | by Salvatore Candido | X, the moonshot factory https://medium.com/loon-for-all/drifting-efficiently-through-the-stratosphere-using-deep-reinforcement-learning-c38723ee2e90 1 comment
- These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project | WIRED https://www.wired.com/story/fast-forward-clues-hint-openai-shadowy-q-project/ 1 comment
- GitHub - the5avage/Q-Learning: Q-Learning for temperature control https://github.com/the5avage/Q-Learning 1 comment
Related searches:
Search whole site: site:en.wikipedia.org
Search title: Q-learning - Wikipedia
See how to search.