Hacker News
Linking pages
- The Transformer Family Version 2.0 | Lil'Log https://lilianweng.github.io/posts/2023-01-27-the-transformer-family-v2/ 46 comments
- Generating music in the waveform domain – Sander Dieleman https://sander.ai/2020/03/24/audio-generation.html 41 comments
- An AI crushed two human pros at StarCraft—but it wasn’t a fair fight | Ars Technica https://arstechnica.com/gaming/2019/01/an-ai-crushed-two-human-pros-at-starcraft-but-it-wasnt-a-fair-fight/ 27 comments
- Distributed Inference and Fine-tuning of Large Language Models Over The Internet https://browse.arxiv.org/html/2312.08361v1 22 comments
- Google at NIPS 2017 – Google AI Blog https://research.googleblog.com/2017/12/google-at-nips-2017.html 17 comments
- GitHub - zziz/pwc: This repository is no longer maintained. https://github.com/zziz/pwc 13 comments
- Microsoft's AI generates realistic speech with only 200 training samples | VentureBeat https://venturebeat.com/2019/05/23/microsofts-ai-generates-realistic-speech-with-only-200-training-samples/ 6 comments
- [Masked] Language Modeling with Recurrent Neural Networks | by Deepak Mishra | Medium https://skilp4d.medium.com/masked-language-modeling-with-recurrent-neural-networks-cf28a7933f61 5 comments
- Attention? Attention! | Lil'Log https://lilianweng.github.io/posts/2018-06-24-attention/ 2 comments
- GitHub - daturkel/learning-papers: Landmark Papers in Machine Learning https://github.com/daturkel/learning-papers 1 comment
- Transformers for Image Recognition at Scale – Google AI Blog https://ai.googleblog.com/2020/12/transformers-for-image-recognition-at.html 1 comment
- Direct Fit to Nature: An Evolutionary Perspective on Biological and Artificial Neural Networks: Neuron https://www.cell.com/neuron/fulltext/S0896-6273(19)31044-X 0 comments
- Google Brain's AI achieves state-of-the-art text summarization performance | VentureBeat https://venturebeat.com/2019/12/23/google-brains-ai-achieves-state-of-the-art-text-summarization-performance/ 0 comments
- From Programs to Deep Models – Part 3: Code Completion | SIGPLAN Blog https://blog.sigplan.org/2020/05/11/from-programs-to-deep-models-part-3-code-completion/ 0 comments
- CS224U: Natural Language Understanding - Spring 2021 https://web.stanford.edu/class/cs224u/2021/ 0 comments
- An Overdue Post on AlphaStar, Part 2 http://www.alexirpan.com/2019/02/22/alphastar-part2.html 0 comments
- Google, Cambridge, DeepMind & Alan Turing Institute’s ‘Performer’ Transformer Slashes Compute Costs | Synced https://syncedreview.com/2020/10/02/google-cambridge-deepmind-alan-turing-institutes-performer-transformer-slashes-compute-costs/ 0 comments
- In-layer normalization techniques for training very deep neural networks | AI Summer https://theaisummer.com/normalization/ 0 comments
- Winning solution for Kaggle challenge: Lyft Motion Prediction for Autonomous Vehicles | by Artsiom Sanakoyeu | Medium https://gdude.medium.com/winning-solution-for-kaggle-challenge-lyft-motion-prediction-for-autonomous-vehicles-6bf67bf86f97 0 comments
- Generating music in the waveform domain – Sander Dieleman https://benanne.github.io/2020/03/24/audio-generation.html 0 comments
Related searches:
Search whole site: site:papers.nips.cc
Search title: Attention is All you Need
See how to search.