Hacker News
- The paper that made ChatGPT possible https://arxiv.org/abs/1706.03762 55 comments
- Attention Is All You Need (Neural Networks) https://arxiv.org/abs/1706.03762 3 comments
- Learnable matrices in sequence without nonlinearity - reasons? [R] https://arxiv.org/pdf/1706.03762 30 comments machinelearning
- ELI5: computational complexity of Transformer models https://arxiv.org/pdf/1706.03762.pdf 2 comments languagetechnology
- [D] If you had to pick 10-20 significant papers that summarize the research trajectory of AI from the past 100 years what would they be https://arxiv.org/abs/1706.03762 82 comments machinelearning
- Tools to draw NN https://arxiv.org/abs/1706.03762 3 comments deeplearning
- How are inputs fed to a Transformer? https://arxiv.org/pdf/1706.03762.pdf 5 comments artificial
Linking pages
- Google AI updates: Bard and new AI features in Search https://blog.google/technology/ai/bard-google-ai-search-updates/ 1934 comments
- Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance – Google AI Blog https://ai.googleblog.com/2022/04/pathways-language-model-palm-scaling-to.html 279 comments
- 8 Google Employees Invented Modern AI. Here’s the Inside Story | WIRED https://www.wired.com/story/eight-google-employees-invented-modern-ai-transformers-paper/ 254 comments
- Goldman Sachs and Economists are Backtracking on Generative AI's Value https://www.ai-supremacy.com/p/goldman-sachs-and-economists-are 238 comments
- “We’ll call it AI to Sell it, Machine Learning to Build it” https://theaiunderwriter.substack.com/p/well-call-it-ai-to-sell-it-machine 236 comments
- Geoffrey Hinton and Demis Hassabis: AGI is nowhere close to being a reality | VentureBeat https://venturebeat.com/2018/12/17/geoffrey-hinton-and-demis-hassabis-agi-is-nowhere-close-to-being-a-reality/ 232 comments
- Understanding ChatGPT - Atmosera https://www.atmosera.com/ai/understanding-chatgpt/ 232 comments
- AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
- Better Language Models and Their Implications https://blog.openai.com/better-language-models/#content 207 comments
- What OpenAI Really Wants | WIRED https://www.wired.com/story/what-openai-really-wants/ 183 comments
- MuseNet https://openai.com/blog/musenet/ 179 comments
- What We Know About LLMs (Primer) https://willthompson.name/what-we-know-about-llms-primer 164 comments
- AI and Compute https://blog.openai.com/ai-and-compute/ 153 comments
- Advancements in machine learning for machine learning – Google Research Blog https://blog.research.google/2023/12/advancements-in-machine-learning-for.html 151 comments
- GitHub - geohot/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️ https://github.com/geohot/tinygrad 144 comments
- GitHub - karpathy/minGPT: A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training https://github.com/karpathy/minGPT 133 comments
- Jukebox https://openai.com/blog/jukebox/ 130 comments
- Google Gemini Eats The World – Gemini Smashes GPT-4 By 5X, The GPU-Poors https://www.semianalysis.com/p/google-gemini-eats-the-world-gemini 113 comments
- How AI-assisted coding will change software engineering: hard truths https://newsletter.pragmaticengineer.com/p/how-ai-will-change-software-engineering 109 comments
- Sergey Brin says AGI is within reach if Googlers work 60-hour weeks - Ars Technica https://arstechnica.com/google/2025/02/sergey-brin-says-agi-is-within-reach-if-googlers-work-60-hour-weeks/ 106 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [1706.03762] Attention Is All You Need
See how to search.