- Old policy and new policy in PPO https://github.com/keiohta/tf2rl 9 comments reinforcementlearning
Linked pages
- TensorFlow http://tensorflow.org/ 440 comments
- [1511.05952] Prioritized Experience Replay https://arxiv.org/abs/1511.05952 19 comments
- [1509.02971] Continuous control with deep reinforcement learning https://arxiv.org/abs/1509.02971 15 comments
- argparse â Parser for command-line options, arguments and sub-commands — Python 3.12.4 documentation https://docs.python.org/3/library/argparse.html#argparse.ArgumentParser.add_argument_group 12 comments
- [1812.05905] Soft Actor-Critic Algorithms and Applications https://arxiv.org/abs/1812.05905 10 comments
- http://www.cs.toronto.edu/~vmnih/docs/dqn.pdf 10 comments
- [1801.01290] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor https://arxiv.org/abs/1801.01290 4 comments
- [1312.6114] Auto-Encoding Variational Bayes https://arxiv.org/abs/1312.6114 4 comments
- https://arxiv.org/abs/1707.06347 3 comments
- [1506.02438] High-Dimensional Continuous Control Using Generalized Advantage Estimation https://arxiv.org/abs/1506.02438 3 comments
- [1706.10295] Noisy Networks for Exploration https://arxiv.org/abs/1706.10295 0 comments
- [1511.06581] Dueling Network Architectures for Deep Reinforcement Learning http://arxiv.org/abs/1511.06581 0 comments
- [1707.06887] A Distributional Perspective on Reinforcement Learning https://arxiv.org/abs/1707.06887 0 comments
- [1509.06461] Deep Reinforcement Learning with Double Q-learning http://arxiv.org/abs/1509.06461 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - keiohta/tf2rl: TensorFlow2 Reinforcement Learning
See how to search.