Linking pages
- Understanding Large Language Models - by Sebastian Raschka https://magazine.sebastianraschka.com/p/understanding-large-language-models 53 comments
- Rethinking Attention with Performers – Google AI Blog https://ai.googleblog.com/2020/10/rethinking-attention-with-performers.html 52 comments
- The Transformer Family Version 2.0 | Lil'Log https://lilianweng.github.io/posts/2023-01-27-the-transformer-family-v2/ 46 comments
- Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch https://sebastianraschka.com/blog/2023/self-attention-from-scratch.html 44 comments
- Understanding Large Language Models -- A Transformative Reading List https://sebastianraschka.com/blog/2023/llm-reading-list.html 26 comments
- Large Transformer Model Inference Optimization | Lil'Log https://lilianweng.github.io/posts/2023-01-10-inference-optimization/ 20 comments
- NLP Research in the Era of LLMs - by Sebastian Ruder https://nlpnewsletter.substack.com/p/nlp-research-in-the-era-of-llms 17 comments
- GitHub - cmhungsteve/Awesome-Transformer-Attention: An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites https://github.com/cmhungsteve/Awesome-Transformer-Attention 13 comments
- Understanding and Coding Self-Attention, Multi-Head Attention, Cross-Attention, and Causal-Attention in LLMs https://magazine.sebastianraschka.com/p/understanding-and-coding-self-attention 11 comments
- Rethinking Attention with Performers – Google AI Blog https://ai.googleblog.com/2020/10/rethinking-attention-with-performers.html?m=1 4 comments
- The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka https://www.latent.space/p/yitay 4 comments
- GitHub - NiuTrans/ABigSurvey: A collection of 700+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML) https://github.com/NiuTrans/ABigSurvey 0 comments
- GitHub - tomohideshibata/BERT-related-papers: BERT-related papers https://github.com/tomohideshibata/BERT-related-papers 0 comments
- GitHub - eugeneyan/ml-surveys: 📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc. https://github.com/eugeneyan/ml-surveys 0 comments
- How is LLaMa.cpp possible? - by Finbarr Timbers https://finbarrtimbers.substack.com/p/how-is-llamacpp-possible 0 comments
- Greater sequence lengths will set us free | Pierce Freeman https://freeman.vc/notes/greater-sequence-lengths-will-set-us-free 0 comments
- Neural machine translation with a Transformer and Keras | Text | TensorFlow https://www.tensorflow.org/text/tutorials/transformer 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2009.06732] Efficient Transformers: A Survey
See how to search.