Hacker News
- Do Large Language Models learn world models or just surface statistics? (2023) https://thegradient.pub/othello/ 77 comments
- Do Large Language Models learn world models or just surface statistics? https://thegradient.pub/othello/ 174 comments
- Research shows Large Language Models such as ChatGPT do develop internal world models and not just statistical correlations https://thegradient.pub/othello/ 202 comments futurology
- Large Language Model: world models or surface statistics? [R] https://thegradient.pub/othello/ 4 comments machinelearning
Linking pages
- Chess-GPT’s Internal World Model | Adam Karvonen https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html 112 comments
- What is AI interpretability? Artificial intelligence researchers are reverse-engineering ChatGPT, Claude, and Gemini. - Vox https://www.vox.com/future-perfect/362759/ai-interpretability-openai-claude-gemini-neuroscience 7 comments
- Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind https://www.dwarkeshpatel.com/p/sholto-douglas-trenton-bricken 3 comments
- ChatGPT, LLMs and Foundation models — a closer look into the hype and implications for startups | by Oliver Molander | Feb, 2023 | Medium https://olivermolander.medium.com/chatgpt-llms-and-foundation-models-a-closer-look-into-the-hype-and-implications-for-startups-b2f1d82f4d46 2 comments
- Language models rely on meaningful abstractions https://dpaleka.substack.com/p/language-models-rely-on-meaningful 1 comment
- Playing Chess - LLMs and Actual Chess AIs - by Zap https://ageofai.substack.com/p/playing-chess-llms-and-actual-chess 0 comments
- Truth https://compphil.github.io/truth/ 0 comments
- Chaotic Thoughts About Order – Gareth Stack – Blog https://garethstack.wordpress.com/2024/02/18/chaotic-thoughts-about-order/ 0 comments
- Are Video Generation Models World Simulators? · Artificial Cognition https://artificialcognition.net/posts/video-generation-world-simulators/ 0 comments
Linked pages
- A.I. Is Mastering Language. Should We Trust What It Says? - The New York Times https://www.nytimes.com/2022/04/15/magazine/ai-language.html 162 comments
- [2210.13382] Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task https://arxiv.org/abs/2210.13382 148 comments
- Andrej Karpathy on Twitter: "Nice read on reverse engineering of GitHub Copilot 🪄. Copilot has dramatically accelerated my coding, it's hard to imagine going back to "manual coding". Still learning to use it but it already writes ~80% of my code, ~80% accuracy. I don't even really code, I prompt. &ampampampampampampampampampampampampampampampampampampampampampampampampampampampampampampampampampampampamp; edit." / Twitter https://twitter.com/karpathy/status/1608895189078380544 3 comments
- Fischer random chess - Wikipedia https://en.wikipedia.org/wiki/Fischer_random_chess 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:thegradient.pub
Search title: Large Language Model: world models or surface statistics?
See how to search.