Proximal Policy Optimization — Spinning Up documentation - discu.eu

Reddit

Tricks and adaptions for PPO https://spinningup.openai.com/en/latest/algorithms/ppo.html 4 comments 15/9/2019 reinforcementlearning

PPO with continuous actions https://spinningup.openai.com/en/latest/algorithms/ppo.html 4 comments 7/1/2019 reinforcementlearning

Linking pages

The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2021/07/14/mappo/ 24 comments
Which Reinforcement learning-RL algorithm to use where, when and in what scenario? | by Ujwal Tewari | DataDrivenInvestor https://medium.com/datadriveninvestor/which-reinforcement-learning-rl-algorithm-to-use-where-when-and-in-what-scenario-e3e7617fb0b1?amp%3Bsk=ab3658c27431dafc50a276a8b166ba1d&source=friends_link 19 comments
GitHub - ericyangyu/PPO-for-Beginners: A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8. https://github.com/ericyangyu/PPO-for-Beginners/tree/master 11 comments
Training an AI for the card game Dominion - Ian W. Davis https://ianwdavis.com/dominion2.html 0 comments
minRLHF: Reinforcement Learning from Human Feedback from Scratch | Tom Tumiel https://ttumiel.com/blog/min-rlhf/ 0 comments
Reviewing Post-Training Techniques from Recent Open LLMs | Brian Fitzgerald https://brianfitzgerald.xyz/dpo-review/ 0 comments

Related searches:

Search whole site: site:spinningup.openai.com

Search title: Proximal Policy Optimization — Spinning Up documentation

See how to search.

Submit link to: