Hacker News
- Transformers for software engineers https://blog.nelhage.com/post/transformers-for-software-engineers/ 20 comments
Linked pages
- [2005.14165] Language Models are Few-Shot Learners https://arxiv.org/abs/2005.14165 201 comments
- OpenAI Codex https://openai.com/blog/openai-codex/ 183 comments
- [1706.03762] Attention Is All You Need https://arxiv.org/abs/1706.03762 145 comments
- GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
- Home \ Anthropic https://www.anthropic.com/ 48 comments
- The Illustrated Transformer – Jay Alammar – Visualizing machine learning one concept at a time. https://jalammar.github.io/illustrated-transformer/ 36 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/transformers 26 comments
- Google Colab https://colab.research.google.com/#scrollTo=Nma_JWh-W-IF 25 comments
- A Mathematical Framework for Transformer Circuits https://transformer-circuits.pub/2021/framework/index.html 9 comments
- Transformer Circuits Thread https://transformer-circuits.pub/ 8 comments
- Rotary Embeddings: A Relative Revolution | EleutherAI Blog https://blog.eleuther.ai/rotary-embeddings/ 1 comment
- TPU Research Cloud - About https://sites.research.google/trc/about/ 0 comments
- AI Safety Needs Great Engineers - LessWrong https://www.lesswrong.com/posts/YDF7XhMThhNfHfim9/ai-safety-needs-great-engineers 0 comments
Related searches:
Search whole site: site:blog.nelhage.com
Search title: Transformers for software engineers - Made of Bugs
See how to search.