Linking pages
- Summarizing Books with Human Feedback https://openai.com/blog/summarizing-books/ 19 comments
- Measuring Goodhart’s Law https://openai.com/blog/measuring-goodharts-law/ 7 comments
- WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/webgpt/ 7 comments
- AI-Written Critiques Help Humans Notice Flaws https://openai.com/blog/critiques/ 6 comments
- WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/improving-factual-accuracy/ 5 comments
- Deepmind's new chatbot is "more helpful, correct, and harmless" https://the-decoder.com/deepminds-new-chatbot-is-more-helpful-correct-and-harmless/ 2 comments
- Scale: Rational in the Fullness of Time https://www.notboring.co/p/scale-rational-in-the-fullness-of 0 comments
- Human Feedback Improves OpenAI Model Summarizations | Synced https://syncedreview.com/2020/09/15/human-feedback-improves-openai-model-summarizations/ 0 comments
- Screening for Ethical API Access at Scale | Sahar Mor | The Startup https://medium.com/@saharhashai/screening-for-ethics-at-scale-611696a2a4e4 0 comments
- ChatGPT: The Latest and Greatest of Large Language Models from OpenAI [Examples and Resources] :: f3.al https://f3.al/chatgpt-definitive-resource/ 0 comments
Linked pages
- ChatGPT https://chat.openai.com/ 742 comments
- [2005.14165] Language Models are Few-Shot Learners https://arxiv.org/abs/2005.14165 201 comments
- OpenAI Status https://status.openai.com 195 comments
- Fine-Tuning GPT-2 from Human Preferences https://openai.com/blog/fine-tuning-gpt-2/ 19 comments
- [2009.01325] Learning to summarize from human feedback https://arxiv.org/abs/2009.01325 12 comments
- [1909.08593] Fine-Tuning Language Models from Human Preferences https://arxiv.org/abs/1909.08593 5 comments
- [1910.10683] Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer https://arxiv.org/abs/1910.10683 1 comment
- [1909.01326] The Woman Worked as a Babysitter: On Biases in Language Generation https://arxiv.org/abs/1909.01326 0 comments
- [1909.01214] Better Rewards Yield Better Summaries: Learning to Summarise Without References https://arxiv.org/abs/1909.01214 0 comments
Related searches:
Search whole site: site:openai.com
Search title: Learning to Summarize with Human Feedback
See how to search.