Hacker News
- Let's Think Dot by Dot: Hidden Computation in Transformer Language Models https://arxiv.org/abs/2404.15758 32 comments
- "transformers can use meaningless filler tokens (e.g., '......') in place of a chain of thought" - Let's Think Dot by Dot [P] https://arxiv.org/abs/2404.15758 11 comments machinelearning
Linking pages
- GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
- From OpenAI to DeepSeek, companies say AI can “reason” now. Is it true? | Vox https://www.vox.com/future-perfect/400531/ai-reasoning-models-openai-deepseek 2 comments
- Enhancing Transformer Models with Filler Tokens: A Novel AI Approach to Boosting Computational Capabilities in Complex Problem Solving - MarkTechPost https://www.marktechpost.com/2024/04/29/enhancing-transformer-models-with-filler-tokens-a-novel-ai-approach-to-boosting-computational-capabilities-in-complex-problem-solving/ 1 comment
- The new law that could protect UK children online – as long as it works | Technology | The Guardian https://www.theguardian.com/global/article/2024/may/14/techscape-online-safety-act-children-internet 0 comments
- GitHub - tmgthb/Autonomous-Agents: Autonomous Agents (LLMs) research papers. Updated Daily. https://github.com/tmgthb/Autonomous-Agents 0 comments
- o1, inference time, Turing Machines and stuff https://zzbbyy.substack.com/p/o1-inference-time-turing-machines 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2404.15758] Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
See how to search.