Hacker News
- The paper that made ChatGPT possible https://arxiv.org/abs/1706.03762 55 comments
- Attention Is All You Need (Neural Networks) https://arxiv.org/abs/1706.03762 3 comments
- [D] If you had to pick 10-20 significant papers that summarize the research trajectory of AI from the past 100 years what would they be https://arxiv.org/abs/1706.03762 82 comments machinelearning
- Tools to draw NN https://arxiv.org/abs/1706.03762 3 comments deeplearning
Linking pages
- Google AI updates: Bard and new AI features in Search https://blog.google/technology/ai/bard-google-ai-search-updates/ 1934 comments
- Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance – Google AI Blog https://ai.googleblog.com/2022/04/pathways-language-model-palm-scaling-to.html 279 comments
- 8 Google Employees Invented Modern AI. Here’s the Inside Story | WIRED https://www.wired.com/story/eight-google-employees-invented-modern-ai-transformers-paper/ 254 comments
- “We’ll call it AI to Sell it, Machine Learning to Build it” https://theaiunderwriter.substack.com/p/well-call-it-ai-to-sell-it-machine 236 comments
- Geoffrey Hinton and Demis Hassabis: AGI is nowhere close to being a reality | VentureBeat https://venturebeat.com/2018/12/17/geoffrey-hinton-and-demis-hassabis-agi-is-nowhere-close-to-being-a-reality/ 232 comments
- Understanding ChatGPT - Atmosera https://www.atmosera.com/ai/understanding-chatgpt/ 232 comments
- AI Canon | Andreessen Horowitz https://a16z.com/2023/05/25/ai-canon/ 219 comments
- Better Language Models and Their Implications https://blog.openai.com/better-language-models/#content 207 comments
- What OpenAI Really Wants | WIRED https://www.wired.com/story/what-openai-really-wants/ 183 comments
- MuseNet https://openai.com/blog/musenet/ 179 comments
- What We Know About LLMs (Primer) https://willthompson.name/what-we-know-about-llms-primer 164 comments
- AI and Compute https://blog.openai.com/ai-and-compute/ 153 comments
- Advancements in machine learning for machine learning – Google Research Blog https://blog.research.google/2023/12/advancements-in-machine-learning-for.html 151 comments
- GitHub - geohot/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️ https://github.com/geohot/tinygrad 144 comments
- GitHub - karpathy/minGPT: A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training https://github.com/karpathy/minGPT 133 comments
- Jukebox https://openai.com/blog/jukebox/ 130 comments
- Google Gemini Eats The World – Gemini Smashes GPT-4 By 5X, The GPU-Poors https://www.semianalysis.com/p/google-gemini-eats-the-world-gemini 113 comments
- GitHub - brexhq/prompt-engineering: Tips and tricks for working with Large Language Models like OpenAI's GPT-4. https://github.com/brexhq/prompt-engineering 105 comments
- Better Language Models and Their Implications https://openai.com/blog/better-language-models/ 99 comments
- AlphaFold 2 is here: what’s behind the structure prediction miracle | Oxford Protein Informatics Group https://www.blopig.com/blog/2021/07/alphafold-2-is-here-whats-behind-the-structure-prediction-miracle/ 93 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [1706.03762] Attention Is All You Need
See how to search.