[1801.01290] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor - discu.eu

Reddit

Could someone explain to me the Policy update in the Soft-Actor Critic algorithm ? https://arxiv.org/abs/1801.01290 4 comments 12/12/2018 reinforcementlearning

Linking pages

GitHub - google-research/seed_rl: SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture. https://github.com/google-research/seed_rl 20 comments
GitHub - higgsfield/RL-Adventure-2: PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay https://github.com/higgsfield/RL-Adventure-2 20 comments
GitHub - rail-berkeley/softlearning: Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm. https://github.com/rail-berkeley/softlearning 14 comments
Soft Actor-Critic Demystified. An intuitive explanation of the theory… | by Vaishak V.Kumar | Towards Data Science https://towardsdatascience.com/soft-actor-critic-demystified-b8427df61665 12 comments
GitHub - haarnoja/sac: Soft Actor-Critic https://github.com/haarnoja/sac 11 comments
GitHub - marload/DeepRL-TensorFlow2: 🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2 https://github.com/marload/deep-rl-tf2 10 comments
GitHub - keiohta/tf2rl: TensorFlow2 Reinforcement Learning https://github.com/keiohta/tf2rl 9 comments
Ingredients for Robotics Research https://openai.com/blog/ingredients-for-robotics-research/ 8 comments
GitHub - trackmania-rl/tmrl: Reinforcement Learning for real-time applications - host of the TrackMania Roborace League https://github.com/trackmania-rl/tmrl 8 comments
GitHub - chaovven/PyRL: PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.) https://github.com/chaovven/PyRL 8 comments
Introducing Huskarl: The Modular Deep Reinforcement Learning Framework | by TensorFlow | Medium https://medium.com/@tensorflow/introducing-huskarl-the-modular-deep-reinforcement-learning-framework-e47d4b228dd3 6 comments
GitHub - medipixel/rl_algorithms: Structural implementation of RL key algorithms https://github.com/medipixel/rl_algorithms 5 comments
GitHub - michaelnny/deep_rl_zoo: A collection of Deep RL algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar. https://github.com/michaelnny/deep_rl_zoo 2 comments
GitHub - the5avage/Q-Learning: Q-Learning for temperature control https://github.com/the5avage/Q-Learning 1 comment
Train a Deep Q Network with TF-Agents | TensorFlow Agents https://www.tensorflow.org/agents/tutorials/1_dqn_tutorial 0 comments
Learning to Influence Multi-Agent Interaction | SAIL Blog http://ai.stanford.edu/blog/lili/ 0 comments
GitHub - banditml/banditml: A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services. https://github.com/banditml/banditml 0 comments
Offline Reinforcement Learning: How Conservative Algorithms Can Enable New Applications – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2020/12/07/offline/ 0 comments
Ingredients for Robotics Research https://blog.openai.com/ingredients-for-robotics-research/ 0 comments
Top 8 trends from ICLR 2019 https://huyenchip.com/2019/05/12/top-8-trends-from-iclr-2019.html 0 comments

Related searches:

Search whole site: site:arxiv.org

Search title: [1801.01290] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

See how to search.

Submit link to: