Hacker News
- Sholto Douglas and Trenton Bricken – How to Build and Understand GPT-7's Mind https://www.dwarkeshpatel.com/p/sholto-douglas-trenton-bricken 3 comments
Linking pages
- What can LLMs never do? - by Rohit Krishnan https://www.strangeloopcanon.com/p/what-can-llms-never-do 376 comments
- Deepseek: The Quiet Giant Leading China’s AI Race https://www.chinatalk.media/p/deepseek-ceo-interview-with-chinas 4 comments
- Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution https://www.dwarkeshpatel.com/p/francois-chollet 2 comments
- Mark Zuckerberg - Llama 3, Open Sourcing $10b Models, & Caeser Augustus https://www.dwarkeshpatel.com/p/mark-zuckerberg 1 comment
- Dylan Patel & Jon (Asianometry) – How the Semiconductor Industry Actually Works https://www.dwarkeshpatel.com/p/dylan-jon 1 comment
- John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI https://www.dwarkeshpatel.com/p/john-schulman 0 comments
- Leopold Aschenbrenner - China/US Super Intelligence Race, 2027 AGI, & The Return of History https://www.dwarkeshpatel.com/p/leopold-aschenbrenner 0 comments
Linked pages
- Introducing Gemini 1.5, Google's next-generation AI model https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/ 715 comments
- Gemma: Google introduces new state-of-the-art open models https://blog.google/technology/developers/gemma-open-models/ 535 comments
- DALL·E 3 https://openai.com/dall-e-3 534 comments
- Large Language Model: world models or surface statistics? https://thegradient.pub/othello/ 458 comments
- https://www.wsj.com/tech/ai/sam-altman-seeks-trillions-of-dollars-to-reshape-business-of-chips-and-ai-89ab3db0 345 comments
- Technological singularity - Wikipedia https://en.wikipedia.org/wiki/Technological_singularity 304 comments
- Mixtral of experts | Mistral AI | Open source models https://mistral.ai/news/mixtral-of-experts/ 300 comments
- Will scaling work? - by Dwarkesh Patel - Dwarkesh Podcast https://www.dwarkeshpatel.com/p/will-scaling-work 286 comments
- AlphaGeometry: An Olympiad-level AI system for geometry - Google DeepMind https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry/ 262 comments
- Why a Conversation With Bing’s Chatbot Left Me Deeply Unsettled - The New York Times https://www.nytimes.com/2023/02/16/technology/bing-chatbot-microsoft-chatgpt.html?referringSource=articleShare&smid=nytcore-ios-share 167 comments
- Stochastic parrot - Wikipedia https://en.wikipedia.org/wiki/Stochastic_parrot 161 comments
- The Friendship That Made Google Huge | The New Yorker https://www.newyorker.com/magazine/2018/12/10/the-friendship-that-made-google-huge 153 comments
- [2401.04088] Mixtral of Experts https://arxiv.org/abs/2401.04088 151 comments
- Bus factor - Wikipedia http://en.wikipedia.org/wiki/Bus_factor 127 comments
- [2212.03827] Discovering Latent Knowledge in Language Models Without Supervision https://arxiv.org/abs/2212.03827 86 comments
- What is a long context window? Google DeepMind engineers explain https://blog.google/technology/ai/long-context-window-ai-models/ 82 comments
- Pair programming - Wikipedia https://en.wikipedia.org/wiki/Pair_programming 67 comments
- The Scaling Hypothesis · Gwern.net https://www.gwern.net/Scaling-hypothesis 65 comments
- Microsoft Copilot: Ihr KI-Begleiter https://copilot.microsoft.com/ 54 comments
- How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog https://siboehm.com/articles/22/CUDA-MMM 49 comments
Related searches:
Search whole site: site:dwarkeshpatel.com
Search title: Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind
See how to search.