[1611.01224] Sample Efficient Actor-Critic with Experience Replay - discu.eu

Reddit

What's best rl algorithm for gpu utilization, for continuous action space? Can ACER be updated in synchronously to achieve better gpu utilization. https://arxiv.org/abs/1611.01224 4 comments 28/10/2019 reinforcementlearning

Linking pages

Introducing Google Research Football: A Novel Reinforcement Learning Environment – Google AI Blog https://ai.googleblog.com/2019/06/introducing-google-research-football.html 29 comments
GitHub - higgsfield/RL-Adventure-2: PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay https://github.com/higgsfield/RL-Adventure-2 20 comments
Understanding why there isn't a log probability in TRPO and PPO's objective https://costa.sh/blog-understanding-why-there-isn't-a-log-probability-in-trpo-and-ppo's-objective.html 10 comments
GitHub - chaovven/PyRL: PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.) https://github.com/chaovven/PyRL 8 comments
OpenAI Baselines: ACKTR & A2C https://blog.openai.com/baselines-acktr-a2c/ 6 comments
Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
GitHub - Kaixhin/ACER: Actor-critic with experience replay https://github.com/Kaixhin/ACER/tree/master 1 comment
Preferred Networks’ ChainerRL Joins PyTorch Ecosystem as ‘PFRL’ | Synced https://syncedreview.com/2020/10/22/preferred-networks-chainerrl-joins-pytorch-ecosystem-as-pfrl/ 0 comments

Related searches:

Search whole site: site:arxiv.org

Search title: [1611.01224] Sample Efficient Actor-Critic with Experience Replay

See how to search.

Submit link to: