- Why does EfficientZero use SimSiam for temporal consistency instead of MAE / MSE? https://arxiv.org/abs/2111.00210 8 comments reinforcementlearning
- "EfficientZero: Mastering Atari Games with Limited Data", Ye et al 2021 (beating humans on ALE-100k/2h by adding self-supervised learning to MuZero-Reanalyze) https://arxiv.org/abs/2111.00210 13 comments reinforcementlearning
Linking pages
- Playing Chess With A Generalized AI | by Ben Bellerose | Towards Data Science https://medium.com/@bellerb/playing-chess-with-a-generalized-ai-b83d64ac71fe 9 comments
- Playing Chess With A Generalized AI | by Ben Bellerose | Towards Data Science https://towardsdatascience.com/playing-chess-with-a-generalized-ai-b83d64ac71fe 1 comment
- Gradient Update #13: FB Shuts Down Facial Recognition, MuZero upgraded to EfficientZero https://thegradientpub.substack.com/p/gradient-update-13-fb-shuts-down?justPublished=true 0 comments
- GitHub - opendilab/LightZero: LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit. https://github.com/opendilab/LightZero 0 comments
- GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2111.00210] Mastering Atari Games with Limited Data
See how to search.