Multi-armed bandit - Wikipedia

Linking pages

janissary https://blog.janissary.xyz/posts/hiring-functional-programming 147 comments
Modern SAT solvers: fast, neat and underused (part 3 of N) — The Coding Nest https://codingnest.com/modern-sat-solvers-fast-neat-and-underused-part-3-of-n/ 128 comments
Deep Reinforcement Learning Doesn't Work Yet https://www.alexirpan.com/2018/02/14/rl-hard.html 90 comments
Evolution as Backstop for Reinforcement Learning · Gwern.net https://www.gwern.net/Backstop#pain-is-the-only-school-teacher 61 comments
Why Tool AIs Want to Be Agent AIs · Gwern.net http://www.gwern.net/Tool%20AI 58 comments
Think more about what to focus on - by Henrik Karlsson https://www.henrikkarlsson.xyz/p/multi-armed-bandit 55 comments
Why Multi-armed Bandit algorithms are superior to A/B testing - Chris Stucchio http://www.chrisstucchio.com/blog/2012/bandit_algorithms_vs_ab.html 50 comments
A decade later, Reddit’s comment sorting still fails to do its job – Single Lunch https://www.singlelunch.com/2019/09/17/a-decade-later-reddits-comment-sorting-still-fails-to-do-its-job/ 46 comments
A/B testing long-form readability on Gwern.net · Gwern.net http://www.gwern.net/AB%20testing#training-a-neural-net-to-generate-css 39 comments
Counterfactual Regret Minimization or How I won any money in Poker? | Nikhil. R https://rnikhil.com/2023/12/31/ai-cfr-solver-poker.html 26 comments
Reinforcement Learning from scratch | by Emmanuel Ameisen | Insight https://blog.insightdatascience.com/reinforcement-learning-from-scratch-819b65f074d8 25 comments
Why Companies Write Terrible Job Posts | Alya's Blog http://alyaabbott.wordpress.com/2014/09/22/why-companies-write-terrible-job-posts/ 20 comments
Timing Technology: Lessons From The Media Lab · Gwern.net https://www.gwern.net/Timing 16 comments
Playing to Win With AI: Is GPT-3 Too Easy? https://scottstevenson.substack.com/p/playing-to-win-with-ai-is-gpt-3-too 9 comments
Fun with Timing Attacks | Robbie Ostrow https://ostro.ws/post-timing-attacks 9 comments
Are Sunk Costs Fallacies? · Gwern.net https://www.gwern.net/Sunk-cost 8 comments
GitHub - rougier/ML-Recipes: A collection of stand-alone Python machine learning recipes https://github.com/rougier/ML-Recipes 8 comments
Self-Optimizing A/B Tests | chanind.github.io https://chanind.github.io/2021/10/11/self-optimizing-ab-tests.html 7 comments
Python with a Dash of C++: Optimizing Recommendation Serving | AI Logs https://ai.ragv.in/posts/python-with-a-dash-of-cpp-optimizing/ 6 comments
When Should I Check The Mail? · Gwern.net http://www.gwern.net/Mail-delivery 6 comments