- [D] Do papers like this "disprove" the stochastic parrot theory? Pretty strong evidence that LLMs can build an internal world model, at least for simple board games. https://arxiv.org/abs/2210.13382 147 comments machinelearning
Linking pages
- Large Language Model: world models or surface statistics? https://thegradient.pub/othello/ 381 comments
- Chess-GPT’s Internal World Model | Adam Karvonen https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html 112 comments
- Manipulating Chess-GPT’s World Model | Adam Karvonen https://adamkarvonen.github.io/machine_learning/2024/03/20/chess-gpt-interventions.html 36 comments
- GitHub - JShollaj/awesome-llm-interpretability: A curated list of Large Language Model (LLM) Interpretability resources. https://github.com/JShollaj/awesome-llm-interpretability 1 comment
- Reinforcement learning is all you need, for next generation language models. https://yuxili.substack.com/p/reinforcement-learning-is-all-you 0 comments
- GitHub - elicit/machine-learning-list https://github.com/elicit/machine-learning-list 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2210.13382] Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
See how to search.