- Tricks and adaptions for PPO https://spinningup.openai.com/en/latest/algorithms/ppo.html 4 comments reinforcementlearning
- PPO with continuous actions https://spinningup.openai.com/en/latest/algorithms/ppo.html 4 comments reinforcementlearning
Linking pages
- The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2021/07/14/mappo/ 24 comments
- Which Reinforcement learning-RL algorithm to use where, when and in what scenario? | by Ujwal Tewari | DataDrivenInvestor https://medium.com/datadriveninvestor/which-reinforcement-learning-rl-algorithm-to-use-where-when-and-in-what-scenario-e3e7617fb0b1?amp%3Bsk=ab3658c27431dafc50a276a8b166ba1d&source=friends_link 19 comments
- GitHub - ericyangyu/PPO-for-Beginners: A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8. https://github.com/ericyangyu/PPO-for-Beginners/tree/master 11 comments
- Training an AI for the card game Dominion - Ian W. Davis https://ianwdavis.com/dominion2.html 0 comments
- minRLHF: Reinforcement Learning from Human Feedback from Scratch | Tom Tumiel https://ttumiel.com/blog/min-rlhf/ 0 comments
Related searches:
Search whole site: site:spinningup.openai.com
Search title: Proximal Policy Optimization — Spinning Up documentation
See how to search.