- What's best rl algorithm for gpu utilization, for continuous action space? Can ACER be updated in synchronously to achieve better gpu utilization. https://arxiv.org/abs/1611.01224 4 comments reinforcementlearning
Linking pages
- Introducing Google Research Football: A Novel Reinforcement Learning Environment – Google AI Blog https://ai.googleblog.com/2019/06/introducing-google-research-football.html 29 comments
- GitHub - higgsfield/RL-Adventure-2: PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay https://github.com/higgsfield/RL-Adventure-2 20 comments
- Understanding why there isn't a log probability in TRPO and PPO's objective https://costa.sh/blog-understanding-why-there-isn't-a-log-probability-in-trpo-and-ppo's-objective.html 10 comments
- GitHub - chaovven/PyRL: PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.) https://github.com/chaovven/PyRL 8 comments
- OpenAI Baselines: ACKTR & A2C https://blog.openai.com/baselines-acktr-a2c/ 6 comments
- Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
- GitHub - Kaixhin/ACER: Actor-critic with experience replay https://github.com/Kaixhin/ACER/tree/master 1 comment
- Preferred Networks’ ChainerRL Joins PyTorch Ecosystem as ‘PFRL’ | Synced https://syncedreview.com/2020/10/22/preferred-networks-chainerrl-joins-pytorch-ecosystem-as-pfrl/ 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1611.01224] Sample Efficient Actor-Critic with Experience Replay
See how to search.