Reinforcement Learning with Prediction-Based Rewards - discu.eu

Reddit

[D] Evaluation policy for Q-learning agents with intrinsic reward https://openai.com/blog/reinforcement-learning-with-prediction-based-rewards/ 3 comments 22/3/2019 reinforcementlearning

Linking pages

Linked pages

ChatGPT https://chat.openai.com/ 756 comments
OpenAI Status https://status.openai.com 195 comments
Learning Montezuma's Revenge from a Single Demonstration https://blog.openai.com/learning-montezumas-revenge-from-a-single-demonstration/ 48 comments
GitHub - mgbellemare/Arcade-Learning-Environment: The Arcade Learning Environment (ALE) -- a platform for AI research. https://github.com/mgbellemare/Arcade-Learning-Environment 21 comments
GitHub - microsoft/malmo: Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presented by this unique environment. --- For installation instructions, scroll down to *Getting Started* below, or visit the project page for more information: https://github.com/microsoft/malmo 18 comments
[1805.11592] Playing hard exploration games by watching YouTube https://arxiv.org/abs/1805.11592 11 comments
GitHub - Unity-Technologies/ml-agents: The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning. https://github.com/Unity-Technologies/ml-agents 5 comments
Proximal Policy Optimization https://blog.openai.com/openai-baselines-ppo/ 5 comments
Curiosity-driven Exploration by Self-supervised Prediction https://pathak22.github.io/noreward-rl/ 4 comments
[1805.11593] Observe and Look Further: Achieving Consistent Performance on Atari https://arxiv.org/abs/1805.11593 4 comments
https://arxiv.org/abs/1707.06347 3 comments
Human-level control through deep reinforcement learning | Nature https://www.nature.com/articles/nature14236 3 comments
[1810.12894] Exploration by Random Network Distillation https://arxiv.org/abs/1810.12894 3 comments
GitHub - openai/universe: Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications. https://github.com/openai/universe 2 comments
GitHub - facebookresearch/CommAI-env: A platform for developing AI systems as described in A Roadmap towards Machine Intelligence - http://arxiv.org/abs/1511.08130 https://github.com/facebookresearch/CommAI-env 0 comments
GitHub - openai/gym: A toolkit for developing and comparing reinforcement learning algorithms. https://github.com/openai/gym 0 comments
[1606.01868] Unifying Count-Based Exploration and Intrinsic Motivation https://arxiv.org/abs/1606.01868 0 comments
GitHub - deepmind/lab: A customisable 3D platform for agent-based AI research https://github.com/deepmind/lab 0 comments
GitHub - openai/retro: Retro Games in Gym https://github.com/openai/retro 0 comments

Related searches:

Search whole site: site:openai.com

Search title: Reinforcement Learning with Prediction-Based Rewards

See how to search.

Submit link to: