- Confusion of hyperparameters in ppo https://arxiv.org/abs/1707.06347 3 comments reinforcementlearning
Linking pages
- Competitive Self-Play https://blog.openai.com/competitive-self-play/ 138 comments
- MLGO: A Machine Learning Framework for Compiler Optimization – Google AI Blog http://ai.googleblog.com/2022/07/mlgo-machine-learning-framework-for.html 81 comments
- Reinforcement Learning with Prediction-Based Rewards https://blog.openai.com/reinforcement-learning-with-prediction-based-rewards/ 38 comments
- GitHub - andri27-ts/Reinforcement-Learning: Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning https://github.com/andri27-ts/60_Days_RL_Challenge 22 comments
- GitHub - google-research/seed_rl: SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture. https://github.com/google-research/seed_rl 20 comments
- GitHub - higgsfield/RL-Adventure-2: PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay https://github.com/higgsfield/RL-Adventure-2 20 comments
- GitHub - lcswillems/torch-ac: Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO https://github.com/lcswillems/torch-ac 15 comments
- Speeding Up Reinforcement Learning with a New Physics Simulation Engine – Google AI Blog https://ai.googleblog.com/2021/07/speeding-up-reinforcement-learning-with.html 13 comments
- Introducing SafeLife: Safety Benchmarks for Reinforcement Learning - Partnership on AI https://www.partnershiponai.org/safelife 12 comments
- baselines/baselines/ppo2 at master · openai/baselines · GitHub https://github.com/openai/baselines/tree/master/baselines/ppo2 12 comments
- GitHub - marload/DeepRL-TensorFlow2: 🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2 https://github.com/marload/deep-rl-tf2 10 comments
- RAdam: A New State-of-the-Art Optimizer for RL? | by Chris Nota | Autonomous Learning Library | Medium https://medium.com/autonomous-learning-library/radam-a-new-state-of-the-art-optimizer-for-rl-442c1e830564 10 comments
- GitHub - keiohta/tf2rl: TensorFlow2 Reinforcement Learning https://github.com/keiohta/tf2rl 9 comments
- The 32 Implementation Details of Proximal Policy Optimization (PPO) Algorithm https://costa.sh/blog-the-32-implementation-details-of-ppo.html 9 comments
- Ingredients for Robotics Research https://openai.com/blog/ingredients-for-robotics-research/ 8 comments
- GitHub - chaovven/PyRL: PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.) https://github.com/chaovven/PyRL 8 comments
- GitHub - ikostrikov/pytorch-a2c-ppo-acktr-gail: PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail 7 comments
- OpenAI Baselines: ACKTR & A2C https://blog.openai.com/baselines-acktr-a2c/ 6 comments
- Introducing Huskarl: The Modular Deep Reinforcement Learning Framework | by TensorFlow | Medium https://medium.com/@tensorflow/introducing-huskarl-the-modular-deep-reinforcement-learning-framework-e47d4b228dd3 6 comments
- Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
Related searches:
Search whole site: site:arxiv.org
Search title: Confusion of hyperparameters in ppo
See how to search.