Hacker News
- Learning from Human Preferences https://blog.openai.com/deep-reinforcement-learning-from-human-preferences/ 7 comments
Linking pages
- Deep Reinforcement Learning Doesn't Work Yet https://www.alexirpan.com/2018/02/14/rl-hard.html 90 comments
- Lessons Learned Reproducing a Deep Reinforcement Learning Paper http://amid.fish/reproducing-deep-rl 37 comments
- AI Safety via Debate https://blog.openai.com/debate/ 17 comments
- Spinning Up in Deep RL: Workshop Review https://blog.openai.com/spinning-up-in-deep-rl-workshop-review/ 5 comments
- Generalizing from Simulation https://blog.openai.com/generalizing-from-simulation/ 0 comments
- AI Safety Needs Social Scientists https://blog.openai.com/ai-safety-needs-social-scientists/ 0 comments
- So there I was, firing a megawatt plasma collider at work... – Google AI Blog https://research.googleblog.com/2017/07/so-there-i-was-firing-megawatt-plasma.html 0 comments
Linked pages
- ChatGPT https://chat.openai.com/ 742 comments
- [1606.06565] Concrete Problems in AI Safety https://arxiv.org/abs/1606.06565 3 comments
- [1706.03741] Deep reinforcement learning from human preferences https://arxiv.org/abs/1706.03741 0 comments
- GitHub - openai/gym: A toolkit for developing and comparing reinforcement learning algorithms. https://github.com/openai/gym 0 comments
- Learning through human feedback https://deepmind.com/blog/learning-through-human-feedback/ 0 comments
Related searches:
Search whole site: site:blog.openai.com
Search title: Learning from Human Preferences
See how to search.