Hacker News
- The Transformer Family https://lilianweng.github.io/posts/2023-01-27-the-transformer-family-v2/ 46 comments
Linking pages
Linked pages
- https://arxiv.org/abs/1706.03762 144 comments
- GitHub - facebookresearch/faiss: A library for efficient similarity search and clustering of dense vectors. https://github.com/facebookresearch/faiss 48 comments
- Locality-sensitive hashing - Wikipedia https://en.wikipedia.org/wiki/Locality-sensitive_hashing 40 comments
- Directed graph - Wikipedia http://en.wikipedia.org/wiki/Directed_graph 34 comments
- How to Train Really Large Models on Many GPUs? | Lil'Log https://lilianweng.github.io/posts/2021-09-25-train-large/ 33 comments
- [2203.08913] Memorizing Transformers https://arxiv.org/abs/2203.08913 32 comments
- Attention is All you Need https://papers.nips.cc/paper/7181-attention-is-all-you-need 30 comments
- google-research/scann at master · google-research/google-research · GitHub https://github.com/google-research/google-research/tree/master/scann 25 comments
- Large Transformer Model Inference Optimization | Lil'Log https://lilianweng.github.io/posts/2023-01-10-inference-optimization/ 20 comments
- Rotation matrix - Wikipedia https://en.wikipedia.org/wiki/Rotation_matrix#Rotation_matrix_from_axis_and_angle 17 comments
- Contrastive Representation Learning | Lil'Log https://lilianweng.github.io/posts/2021-05-31-contrastive/ 10 comments
- [2106.01345] Decision Transformer: Reinforcement Learning via Sequence Modeling https://arxiv.org/abs/2106.01345 9 comments
- [2007.14062] Big Bird: Transformers for Longer Sequences https://arxiv.org/abs/2007.14062 0 comments
- [2001.04451] Reformer: The Efficient Transformer https://arxiv.org/abs/2001.04451 0 comments
- [2009.06732] Efficient Transformers: A Survey https://arxiv.org/abs/2009.06732 0 comments
- [2006.04768] Linformer: Self-Attention with Linear Complexity https://arxiv.org/abs/2006.04768 0 comments
- [2207.07061] Confident Adaptive Language Modeling https://arxiv.org/abs/2207.07061 0 comments
Related searches:
Search whole site: site:lilianweng.github.io
Search title: The Transformer Family Version 2.0 | Lil'Log
See how to search.