[2112.11446] Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Linking pages

Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance – Google AI Blog https://ai.googleblog.com/2022/04/pathways-language-model-palm-scaling-to.html 279 comments
Minerva: Solving Quantitative Reasoning Problems with Language Models – Google AI Blog http://ai.googleblog.com/2022/06/minerva-solving-quantitative-reasoning.html 103 comments
Role play with large language models | Nature https://www.nature.com/articles/s41586-023-06647-8 89 comments
Why the Original Transformer Figure Is Wrong, and Some Other Interesting Historical Tidbits About LLMs https://magazine.sebastianraschka.com/p/why-the-original-transformer-figure 60 comments
Characterizing Emergent Phenomena in Large Language Models – Google AI Blog https://ai.googleblog.com/2022/11/characterizing-emergent-phenomena-in.html 57 comments
GitHub - lucidrains/x-transformers: A simple but complete full-attention transformer with a set of promising experimental features from various papers https://github.com/lucidrains/x-transformers 40 comments
Forecasting the future of artificial intelligence with machine learning-based link prediction in an exponentially growing knowledge network | Nature Machine Intelligence https://www.nature.com/articles/s42256-023-00735-0 24 comments
Distributed Inference and Fine-tuning of Large Language Models Over The Internet https://browse.arxiv.org/html/2312.08361v1 22 comments
GitHub - s-JoL/Open-Llama: The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF. https://github.com/s-JoL/Open-Llama 13 comments
Google Research, 2022 & beyond: Language, vision and generative models – Google AI Blog https://ai.googleblog.com/2023/01/google-research-2022-beyond-language.html 5 comments
RLHF: Reinforcement Learning from Human Feedback https://huyenchip.com/2023/05/02/rlhf.html 1 comment
Google Trains 280 Billion Parameter AI Language Model Gopher https://www.infoq.com/news/2022/01/deepmind-gopher/ 0 comments
GitHub - tomohideshibata/BERT-related-papers: BERT-related papers https://github.com/tomohideshibata/BERT-related-papers 0 comments
Five years of progress in GPTs - by Finbarr Timbers https://finbarrtimbers.substack.com/p/five-years-of-progress-in-gpts 0 comments
Timeline of AI and language models – Dr Alan D. Thompson – Life Architect https://lifearchitect.ai/timeline/ 0 comments
GitHub - RUCAIBox/LLMSurvey: The official GitHub page for the survey paper "A Survey of Large Language Models". https://github.com/RUCAIBox/LLMSurvey 0 comments
GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
[2209.10372] WeLM: A Well-Read Pre-trained Language Model for Chinese https://ar5iv.labs.arxiv.org/html/2209.10372 0 comments