[2112.09332] WebGPT: Browser-assisted question-answering with human feedback - discu.eu

Hacker News

WebGPT: Browser-assisted question-answering with human feedback https://arxiv.org/abs/2112.09332 3 comments 23/1/2022

Linking pages

What We Know About LLMs (Primer) https://willthompson.name/what-we-know-about-llms-primer 164 comments
How RLHF actually works - by Nathan Lambert - Interconnects https://www.interconnects.ai/p/how-rlhf-works 32 comments
GitHub - WooooDyy/LLM-Agent-Paper-List: The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al. https://github.com/WooooDyy/LLM-Agent-Paper-List 28 comments
Measuring Goodhart’s Law https://openai.com/blog/measuring-goodharts-law/ 7 comments
WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/webgpt/ 7 comments
WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing https://openai.com/blog/improving-factual-accuracy/ 5 comments
Foundation Models: The future (still) isn't happening fast enough https://www.madrona.com/foundation-models/ 1 comment
Reward Modeling for Large language models (with code) https://explodinggradients.com/reward-modeling-for-large-language-models-with-code 1 comment
The State of Machine Learning in 8 Papers — February, 2022 | by Sergi Castella i Sapé | Heartbeat https://heartbeat.comet.ml/the-state-of-machine-learning-in-8-papers-february-2022-4cf0293f1b6?gi=3f78497257a1 0 comments
GitHub - tomohideshibata/BERT-related-papers: BERT-related papers https://github.com/tomohideshibata/BERT-related-papers 0 comments
ChatGPT: The Latest and Greatest of Large Language Models from OpenAI [Examples and Resources] :: f3.al https://f3.al/chatgpt-definitive-resource/ 0 comments
LLMs and a possible future for Search | by Grigory Sapunov | Dec, 2022 | Intento https://blog.inten.to/llms-and-a-possible-future-for-search-507f900ac9d2 0 comments
Google is Leading the AGI race. But can it win? https://sergey.substack.com/p/google-is-leading-the-agi-race-but 0 comments
The Next Generation Of Large Language Models https://www.forbes.com/sites/robtoews/2023/02/07/the-next-generation-of-large-language-models/ 0 comments
GitHub - opendilab/awesome-RLHF: A curated list of reinforcement learning with human feedback resources (continually updated) https://github.com/opendilab/awesome-RLHF 0 comments
ReAct: Synergizing Reasoning and Acting in Language Models – Google AI Blog https://ai.googleblog.com/2022/11/react-synergizing-reasoning-and-acting.html 0 comments
Uncertain Simulators Don't Always Simulate Uncertain Agents | Daniel D. Johnson https://www.danieldjohnson.com/2023/03/27/uncertain_simulators/ 0 comments
Transformer Taxonomy (the last lit review) | kipply's blog https://kipp.ly/blog/transformer-taxonomy/ 0 comments
GitHub - RUCAIBox/LLMSurvey: The official GitHub page for the survey paper "A Survey of Large Language Models". https://github.com/RUCAIBox/LLMSurvey 0 comments
RLHF learning resources in 2024 - by Nathan Lambert https://www.interconnects.ai/p/rlhf-resources 0 comments

Related searches:

Search whole site: site:arxiv.org

Search title: [2112.09332] WebGPT: Browser-assisted question-answering with human feedback

See how to search.

Submit link to: