- Transformers made so simple your grandma can code it now https://goyalpramod.github.io/blogs/Transformers_laid_out/ 43 comments learnmachinelearning
Linking pages
Linked pages
- xkcd: Precision vs Accuracy https://xkcd.com/ 404 comments
- [1706.03762] Attention Is All You Need https://arxiv.org/abs/1706.03762 145 comments
- The Illustrated Transformer – Jay Alammar – Visualizing machine learning one concept at a time. https://jalammar.github.io/illustrated-transformer/ 36 comments
- 3Blue1Brown https://www.3blue1brown.com/topics/neural-networks 17 comments
- PyTorch internals : ezyang’s blog http://blog.ezyang.com/2019/05/pytorch-internals/ 10 comments
- Linear — PyTorch 1.13 documentation https://pytorch.org/docs/stable/generated/torch.nn.Linear.html#torch.nn.Linear 8 comments
- The Annotated Transformer http://nlp.seas.harvard.edu/annotated-transformer/ 1 comment
- Batch and Layer Normalization | Pinecone https://www.pinecone.io/learn/batch-layer-normalization/ 0 comments
- What is torch.nn really? — PyTorch Tutorials 1.13.1+cu117 documentation https://pytorch.org/tutorials/beginner/nn_tutorial.html 0 comments
- Vanishing gradient problem - Wikipedia http://en.wikipedia.org/wiki/Vanishing_gradient_problem 0 comments
- Linear Relationships in the Transformerâs Positional Encoding - Timo Denk's Blog https://blog.timodenk.com/linear-relationships-in-the-transformers-positional-encoding/ 0 comments
Related searches:
Search whole site: site:goyalpramod.github.io
Search title: Transformers Laid Out | Pramod’s Blog
See how to search.