Linking pages
- janissary https://blog.janissary.xyz/posts/hiring-functional-programming 147 comments
- Modern SAT solvers: fast, neat and underused (part 3 of N) — The Coding Nest https://codingnest.com/modern-sat-solvers-fast-neat-and-underused-part-3-of-n/ 128 comments
- Deep Reinforcement Learning Doesn't Work Yet https://www.alexirpan.com/2018/02/14/rl-hard.html 90 comments
- Evolution as Backstop for Reinforcement Learning · Gwern.net https://www.gwern.net/Backstop#pain-is-the-only-school-teacher 61 comments
- Why Tool AIs Want to Be Agent AIs · Gwern.net http://www.gwern.net/Tool%20AI 58 comments
- Think more about what to focus on - by Henrik Karlsson https://www.henrikkarlsson.xyz/p/multi-armed-bandit 55 comments
- Why Multi-armed Bandit algorithms are superior to A/B testing - Chris Stucchio http://www.chrisstucchio.com/blog/2012/bandit_algorithms_vs_ab.html 50 comments
- A decade later, Reddit’s comment sorting still fails to do its job – Single Lunch https://www.singlelunch.com/2019/09/17/a-decade-later-reddits-comment-sorting-still-fails-to-do-its-job/ 46 comments
- A/B testing long-form readability on Gwern.net · Gwern.net http://www.gwern.net/AB%20testing#training-a-neural-net-to-generate-css 39 comments
- Counterfactual Regret Minimization or How I won any money in Poker? | Nikhil. R https://rnikhil.com/2023/12/31/ai-cfr-solver-poker.html 26 comments
- Reinforcement Learning from scratch | by Emmanuel Ameisen | Insight https://blog.insightdatascience.com/reinforcement-learning-from-scratch-819b65f074d8 25 comments
- Why Companies Write Terrible Job Posts | Alya's Blog http://alyaabbott.wordpress.com/2014/09/22/why-companies-write-terrible-job-posts/ 20 comments
- Timing Technology: Lessons From The Media Lab · Gwern.net https://www.gwern.net/Timing 16 comments
- Playing to Win With AI: Is GPT-3 Too Easy? https://scottstevenson.substack.com/p/playing-to-win-with-ai-is-gpt-3-too 9 comments
- Are Sunk Costs Fallacies? · Gwern.net https://www.gwern.net/Sunk-cost 8 comments
- GitHub - rougier/ML-Recipes: A collection of stand-alone Python machine learning recipes https://github.com/rougier/ML-Recipes 8 comments
- Self-Optimizing A/B Tests | chanind.github.io https://chanind.github.io/2021/10/11/self-optimizing-ab-tests.html 7 comments
- Python with a Dash of C++: Optimizing Recommendation Serving | AI Logs https://ai.ragv.in/posts/python-with-a-dash-of-cpp-optimizing/ 6 comments
- When Should I Check The Mail? · Gwern.net http://www.gwern.net/Mail-delivery 6 comments
- Three activist tools we need. Post Trump, there is a ton of energy in… | by holmesworcester | Medium https://medium.com/@holmesworcester/three-activist-tools-we-need-86346eb6762c 5 comments
Related searches:
Search whole site: site:en.wikipedia.org
Search title: Multi-armed bandit - Wikipedia
See how to search.