- Infinite Horizon problem with SAC and custom environment https://arxiv.org/abs/1812.05905 2 comments reinforcementlearning
- Automating Entropy Adjustment for Maximum Entropy RL https://arxiv.org/abs/1812.05905 8 comments reinforcementlearning
Linking pages
- ROBEL: Robotics Benchmarks for Learning with Low-Cost Robots – Google AI Blog https://ai.googleblog.com/2019/10/robel-robotics-benchmarks-for-learning.html 25 comments
- GitHub - rail-berkeley/softlearning: Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm. https://github.com/rail-berkeley/softlearning 14 comments
- Soft Actor-Critic Demystified. An intuitive explanation of the theory… | by Vaishak V.Kumar | Towards Data Science https://towardsdatascience.com/soft-actor-critic-demystified-b8427df61665 12 comments
- GitHub - keiohta/tf2rl: TensorFlow2 Reinforcement Learning https://github.com/keiohta/tf2rl 9 comments
- GitHub - ikostrikov/pytorch-a2c-ppo-acktr-gail: PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail 7 comments
- Tackling Open Challenges in Offline Reinforcement Learning – Google AI Blog https://ai.googleblog.com/2020/08/tackling-open-challenges-in-offline.html 6 comments
- GitHub - tinker495/jax-baseline https://github.com/tinker495/jax-baseline 5 comments
- SAC minitaur with the Actor-Learner API | TensorFlow Agents https://www.tensorflow.org/agents/tutorials/7_SAC_minitaur_tutorial 4 comments
- Soft Actor CriticâDeep Reinforcement Learning with Real-World Robots – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2018/12/14/sac/ 4 comments
- GitHub - google/dopamine: Dopamine is a research framework for fast prototyping of reinforcement learning algorithms. https://github.com/google/dopamine 1 comment
- GitHub - thomashiemstra/fred: This my 3d printed robot arm project https://github.com/thomashiemstra/fred 0 comments
- Data-Driven Deep Reinforcement Learning – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2019/12/05/bear/ 0 comments
- Does On-Policy Data Collection Fix Errors in Off-Policy Reinforcement Learning? – The Berkeley Artificial Intelligence Research Blog http://bair.berkeley.edu/blog/2020/03/16/discor/ 0 comments
- Soft Actor-Critic: Deep Reinforcement Learning for Robotics – Google AI Blog http://ai.googleblog.com/2019/01/soft-actor-critic-deep-reinforcement.html 0 comments
- GitHub - tensorflow/agents: TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning. https://github.com/tensorflow/agents 0 comments
- GitHub - ikostrikov/pytorch-a2c-ppo-acktr-gail: PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). https://github.com/ikostrikov/pytorch-a2c-ppo-acktr 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1812.05905] Soft Actor-Critic Algorithms and Applications
See how to search.