Linking pages
- A decoder-only foundation model for time-series forecasting – Google Research Blog https://blog.research.google/2024/02/a-decoder-only-foundation-model-for.html 78 comments
- Reformer: The Efficient Transformer – Google AI Blog https://ai.googleblog.com/2020/01/reformer-efficient-transformer.html 44 comments
- MaMMUT: A simple vision-encoder text-decoder architecture for multimodal tasks – Google AI Blog https://ai.googleblog.com/2023/05/mammut-simple-vision-encoder-text.html 33 comments
- AI spots 40,000 prominent scientists overlooked by Wikipedia - The Verge https://www.theverge.com/2018/8/8/17663544/ai-scientists-wikipedia-primer 27 comments
- The Illustrated Transformer – Jay Alammar – Visualizing machine learning one concept at a time. https://jalammar.github.io/illustrated-transformer/ 25 comments
- The Illustrated Transformer – Jay Alammar – Visualizing machine learning one concept at a time. https://jalammar.github.io/illustrated-transformer/ 25 comments
- The Illustrated GPT-2 (Visualizing Transformer Language Models) – Jay Alammar – Visualizing machine learning one concept at a time. http://jalammar.github.io/illustrated-gpt2/ 8 comments
- The Annotated GPT-2 | Committed towards better future https://amaarora.github.io/2020/02/18/annotatedGPT2.html 2 comments
- GitHub - icoxfog417/awesome-text-summarization: The guide to tackle with the Text Summarization https://github.com/icoxfog417/awesome-text-summarization 1 comment
- What makes transformers unreasonably effective? https://crypticsilicon.substack.com/p/what-makes-transformers-unreasonably 1 comment
- GitHub - tensorflow/tensor2tensor: Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research. https://github.com/tensorflow/tensor2tensor 0 comments
- Doctor GPT-3 - by Leon Lin - Avoid Boring People https://avoidboringpeople.substack.com/p/doctor-gpt-3 0 comments
- Deep Learning in Production: Sentiment Analysis with the Transformer | by Ivan Zhang | Medium https://medium.com/cortex-labs/deep-learning-in-production-sentiment-analysis-with-the-transformer-model-7fa053d0c85b 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1801.10198] Generating Wikipedia by Summarizing Long Sequences
See how to search.