Fine-Tuning GPT-2 from Human Preferences - discu.eu

Hacker News

Fine-Tuning GPT-2 from Human Preferences https://openai.com/blog/fine-tuning-gpt-2/ 14 comments 19/9/2019

Reddit

"Fine-Tuning GPT-2 from Human Preferences" [training text generation using human ratings of quality] https://openai.com/blog/fine-tuning-gpt-2/ 5 comments 19/9/2019 reinforcementlearning

Linking pages

It Looks Like You’re Trying To Take Over The World · Gwern.net https://www.gwern.net/Clippy 274 comments
The Neural Net Tank Urban Legend · Gwern.net https://www.gwern.net/Tanks 61 comments
Safety Gym https://openai.com/blog/safety-gym/ 17 comments
Generating Fake News with OpenAI’s Language Models | by Adrian Yijie Xu | Towards Data Science https://towardsdatascience.com/creating-fake-news-with-openais-language-models-368e01a698a3#f7e4-509ea6e7d52d 16 comments
WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/webgpt/ 7 comments
WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/improving-factual-accuracy/ 5 comments
How to Label 1M Data Points/Week - Scale https://scale.com/blog/how-to-label-1m-data-points-week 3 comments
Learning to Summarize with Human Feedback https://openai.com/blog/learning-to-summarize-with-human-feedback/ 0 comments
Scale: Rational in the Fullness of Time https://www.notboring.co/p/scale-rational-in-the-fullness-of 0 comments
GitHub - tomohideshibata/BERT-related-papers: BERT-related papers https://github.com/tomohideshibata/BERT-related-papers 0 comments
OpenAI's latest GPT-3 model generates better and longer texts https://the-decoder.com/openais-latest-gpt-3-model-generates-better-and-longer-texts/ 0 comments
Truth https://compphil.github.io/truth/ 0 comments

Linked pages

Related searches:

Search whole site: site:openai.com

Search title: Fine-Tuning GPT-2 from Human Preferences

See how to search.

Submit link to: