Hacker News
Linking pages
- GitHub - higgsfield/RL-Adventure-2: PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay https://github.com/higgsfield/RL-Adventure-2 20 comments
- GitHub - ikostrikov/pytorch-a2c-ppo-acktr-gail: PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail 7 comments
- GitHub - qfettes/DeepRL-Tutorials: Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch https://github.com/qfettes/DeepRL-Tutorials 4 comments
- Elon Musk's Research Venture Has Trained AI To Teach Itself https://futurism.com/elon-musks-research-venture-has-trained-ai-to-teach-itself/ 2 comments
- Berkeley Deep RL Bootcamp http://planspace.org/20170830-berkeley_deep_rl_bootcamp/ 0 comments
- GitHub - ikostrikov/pytorch-a2c-ppo-acktr-gail: PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). https://github.com/ikostrikov/pytorch-a2c-ppo-acktr 0 comments
Linked pages
- ChatGPT https://chat.openai.com/ 742 comments
- OpenAI Baselines: DQN https://blog.openai.com/openai-baselines-dqn/ 36 comments
- [1611.05763] Learning to reinforcement learn https://arxiv.org/abs/1611.05763 9 comments
- https://arxiv.org/abs/1602.01783 7 comments
- [1611.01224] Sample Efficient Actor-Critic with Experience Replay https://arxiv.org/abs/1611.01224 4 comments
- https://arxiv.org/abs/1707.06347 3 comments
- GitHub - openai/baselines: OpenAI Baselines: high-quality implementations of reinforcement learning algorithms https://github.com/openai/baselines 3 comments
- [1503.05671] Optimizing Neural Networks with Kronecker-factored Approximate Curvature https://arxiv.org/abs/1503.05671 3 comments
Related searches:
Search whole site: site:blog.openai.com
Search title: OpenAI Baselines: ACKTR & A2C
See how to search.