Hacker News
- "Fine-Tuning GPT-2 from Human Preferences" [training text generation using human ratings of quality] https://openai.com/blog/fine-tuning-gpt-2/ 5 comments reinforcementlearning
Linking pages
- It Looks Like You’re Trying To Take Over The World · Gwern.net https://www.gwern.net/Clippy 274 comments
- The Neural Net Tank Urban Legend · Gwern.net https://www.gwern.net/Tanks 61 comments
- Safety Gym https://openai.com/blog/safety-gym/ 17 comments
- Generating Fake News with OpenAI’s Language Models | by Adrian Yijie Xu | Towards Data Science https://towardsdatascience.com/creating-fake-news-with-openais-language-models-368e01a698a3#f7e4-509ea6e7d52d 16 comments
- WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/webgpt/ 7 comments
- WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/improving-factual-accuracy/ 5 comments
- How to Label 1M Data Points/Week - Scale https://scale.com/blog/how-to-label-1m-data-points-week 3 comments
- Learning to Summarize with Human Feedback https://openai.com/blog/learning-to-summarize-with-human-feedback/ 0 comments
- Scale: Rational in the Fullness of Time https://www.notboring.co/p/scale-rational-in-the-fullness-of 0 comments
- GitHub - tomohideshibata/BERT-related-papers: BERT-related papers https://github.com/tomohideshibata/BERT-related-papers 0 comments
- OpenAI's latest GPT-3 model generates better and longer texts https://the-decoder.com/openais-latest-gpt-3-model-generates-better-and-longer-texts/ 0 comments
- Truth https://compphil.github.io/truth/ 0 comments
Linked pages
- ChatGPT https://chat.openai.com/ 742 comments
- OpenAI Status https://status.openai.com 195 comments
- GPT-2: 6-Month Follow-Up https://openai.com/blog/gpt-2-6-month-follow-up/ 96 comments
- Accelerate the Development of AI Applications | Scale AI https://scale.ai/ 11 comments
- [1909.08593] Fine-Tuning Language Models from Human Preferences https://arxiv.org/abs/1909.08593 5 comments
- Andon (manufacturing) - Wikipedia https://en.wikipedia.org/wiki/Andon_(manufacturing) 0 comments
Related searches:
Search whole site: site:openai.com
Search title: Fine-Tuning GPT-2 from Human Preferences
See how to search.