Hacker News
- "Player of Games", Schmid et al 2021 {DM} (generalizing AlphaZero to imperfect-information games) https://arxiv.org/abs/2112.03178#deepmind 6 comments reinforcementlearning
- Deepmind’s latest paper : Player of Games https://arxiv.org/abs/2112.03178 8 comments futurology
Linking pages
- It Looks Like You’re Trying To Take Over The World · Gwern.net https://www.gwern.net/fiction/Clippy 33 comments
- GitHub - captn3m0/boardgame-research: List of research around modern boardgames. https://github.com/captn3m0/boardgame-research 6 comments
- Liar’s Dice by Self-Play. With Counterfactual Regret and Neural… | by Thomas Dybdahl Ahle | Towards Data Science https://medium.com/@lobais/lairs-dice-by-self-play-3bbed6addde0 3 comments
- DeepMind’s PoG Excels in Perfect and Imperfect Information Games, Advancing Research on General Algorithms for Arbitrary Environments | Synced https://syncedreview.com/2021/12/08/deepmind-podracer-tpu-based-rl-frameworks-deliver-exceptional-performance-at-low-cost-161/ 0 comments
- Why do LLMs use greedy sampling? - by Finbarr Timbers https://www.artfintel.com/p/why-do-llms-use-greedy-sampling 0 comments
- GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2112.03178] Player of Games
See how to search.