Hacker News
- Reinforcement Learning: From Zero to State of the Art with Pytorch 4 https://github.com/higgsfield/RL-Adventure-2 14 comments
- RL-Adventure-2: PyTorch implementations of A2C/GAE/PPO/ACER/DDPG/TDDDPG/GAIL/HER with Cartpole demos [Dulat Yerzat] https://github.com/higgsfield/RL-Adventure-2 6 comments reinforcementlearning
Linking pages
- reinforcement_learning.md · GitHub https://gist.github.com/mateuspontesm/5132df449875125af32412e5c4e73215 14 comments
- Soft Actor-Critic Demystified. An intuitive explanation of the theory… | by Vaishak V.Kumar | Towards Data Science https://towardsdatascience.com/soft-actor-critic-demystified-b8427df61665 12 comments
- GitHub - bharathgs/Awesome-pytorch-list: A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc. https://github.com/bharathgs/Awesome-pytorch-list 0 comments
Linked pages
- http://rll.berkeley.edu/deeprlcourse/ 18 comments
- [1509.02971] Continuous control with deep reinforcement learning https://arxiv.org/abs/1509.02971 15 comments
- [1707.01495] Hindsight Experience Replay https://arxiv.org/abs/1707.01495 11 comments
- https://arxiv.org/abs/1602.01783 7 comments
- OpenAI Baselines: ACKTR & A2C https://blog.openai.com/baselines-acktr-a2c/ 6 comments
- Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
- GitHub - yandexdataschool/Practical_RL: A course in reinforcement learning in the wild https://github.com/yandexdataschool/Practical_RL 4 comments
- [1611.01224] Sample Efficient Actor-Critic with Experience Replay https://arxiv.org/abs/1611.01224 4 comments
- RL-Adventure-2/3.ppo.ipynb at master · higgsfield/RL-Adventure-2 · GitHub https://github.com/higgsfield/RL-Adventure-2/blob/master/3.ppo.ipynb 4 comments
- [1801.01290] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor https://arxiv.org/abs/1801.01290 4 comments
- https://arxiv.org/abs/1707.06347 3 comments
- [1506.02438] High-Dimensional Continuous Control Using Generalized Advantage Estimation https://arxiv.org/abs/1506.02438 3 comments
- Deep RL Bootcamp - Lectures https://sites.google.com/view/deep-rl-bootcamp/lectures 0 comments
- http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html 0 comments
- Ingredients for Robotics Research https://blog.openai.com/ingredients-for-robotics-research/ 0 comments
- GitHub - higgsfield/RL-Adventure: Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL https://github.com/higgsfield/RL-Adventure 0 comments