- Understanding PPO with Recurrent Policies https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail 7 comments reinforcementlearning
Linking pages
- GitHub - wwxFromTju/awesome-reinforcement-learning-lib: GitHub's code repository is all you need https://github.com/wwxFromTju/awesome-reinforcement-learning-lib 19 comments
- GitHub - marlbenchmark/on-policy: This is the official implementation of Multi-Agent PPO (MAPPO). https://github.com/marlbenchmark/on-policy 7 comments
- Battlesnake Post Mortem. Using a desktop GPU to top the global… | by Cory Binnersley | Asymptotic Labs | Medium https://medium.com/asymptoticlabs/battlesnake-post-mortem-a5917f9a3428 1 comment
- GitHub - SforAiDl/genrl: A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL https://github.com/SforAiDl/genrl 1 comment
- AllenAct https://allenact.org 0 comments
Linked pages
- PyTorch http://pytorch.org/ 100 comments
- GitHub - mgbellemare/Arcade-Learning-Environment: The Arcade Learning Environment (ALE) -- a platform for AI research. https://github.com/mgbellemare/Arcade-Learning-Environment 21 comments
- https://gym.openai.com/ 18 comments
- [1812.05905] Soft Actor-Critic Algorithms and Applications https://arxiv.org/abs/1812.05905 10 comments
- OpenAI Baselines: ACKTR & A2C https://blog.openai.com/baselines-acktr-a2c/ 6 comments
- Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
- MuJoCo — Advanced Physics Simulation http://www.mujoco.org/ 4 comments
- https://arxiv.org/abs/1707.06347 3 comments
- [1709.06560] Deep Reinforcement Learning that Matters https://arxiv.org/abs/1709.06560 3 comments
- GitHub - deepmind/dm_control: DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo. https://github.com/deepmind/dm_control 0 comments
- [1801.00690] DeepMind Control Suite https://arxiv.org/abs/1801.00690 0 comments