Hacker News
Linked pages
- OpenAI has hired an army of contractors to make basic coding obsolete | Semafor https://www.semafor.com/article/01/27/2023/openai-has-hired-an-army-of-contractors-to-make-basic-coding-obsolete 1384 comments
- Bing https://bing.com 750 comments
- ChatGPT https://chat.openai.com/ 742 comments
- Home \ Anthropic https://www.anthropic.com/ 48 comments
- Illustrating Reinforcement Learning from Human Feedback (RLHF) https://huggingface.co/blog/rlhf 14 comments
- [2009.01325] Learning to summarize from human feedback https://arxiv.org/abs/2009.01325 12 comments
- [2204.05862] Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback https://arxiv.org/abs/2204.05862 1 comment
- https://arxiv.org/abs/2203.02155 0 comments
Related searches:
Search whole site: site:maestroai.substack.com
Search title: Unpacking the HF in RLHF - by Justin Cranshaw
See how to search.