- [R] Improving language models by retrieving from trillions of tokens https://arxiv.org/abs/2112.04426 9 comments machinelearning
Linking pages
- AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
- The Illustrated Retrieval Transformer – Jay Alammar – Visualizing machine learning one concept at a time. http://jalammar.github.io/illustrated-retrieval-transformer/ 55 comments
- Language models in the biomedical and clinical tasks | by Albarqawi | Jan, 2023 | Medium https://albarqawi.medium.com/language-models-in-the-biomedical-and-clinical-tasks-b0fa4eefc210 1 comment
- Good News About the Carbon Footprint of Machine Learning Training – Google AI Blog https://ai.googleblog.com/2022/02/good-news-about-carbon-footprint-of.html 0 comments
- The State of Machine Learning in 8 Papers — February, 2022 | by Sergi Castella i Sapé | Heartbeat https://heartbeat.comet.ml/the-state-of-machine-learning-in-8-papers-february-2022-4cf0293f1b6?gi=3f78497257a1 0 comments
- DeepMind’s RETRO Retrieval-Enhanced Transformer Retrieves from Trillions of Tokens, Achieving Performance Comparable to GPT-3 With 25× Fewer Parameters | Synced https://syncedreview.com/2021/12/13/deepmind-podracer-tpu-based-rl-frameworks-deliver-exceptional-performance-at-low-cost-164/ 0 comments
- GitHub - tomohideshibata/BERT-related-papers: BERT-related papers https://github.com/tomohideshibata/BERT-related-papers 0 comments
- LLMs and the Emerging ML Tech Stack | by Brian | Feb, 2023 | Medium https://medium.com/@brian_90925/llms-and-the-emerging-ml-tech-stack-6fa66ee4561a 0 comments
- What to Watch in AI - The Generalist https://thegeneralist.substack.com/p/what-to-watch-in-ai-3 0 comments
- Customizing coding companions for organizations | AWS Machine Learning Blog https://aws.amazon.com/blogs/machine-learning/customizing-coding-companions-for-organizations/ 0 comments
- Retrieval Augmented Generation (RAG) for LLMs | Prompt Engineering Guide https://www.promptingguide.ai/research/rag 0 comments
- GitHub - microsoft/Megatron-DeepSpeed: Ongoing research training transformer language models at scale, including: BERT & GPT-2 https://github.com/microsoft/Megatron-DeepSpeed 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2112.04426] Improving language models by retrieving from trillions of tokens
See how to search.