[2404.15758] Let's Think Dot by Dot: Hidden Computation in Transformer Language Models - discu.eu

Hacker News

Let's Think Dot by Dot: Hidden Computation in Transformer Language Models https://arxiv.org/abs/2404.15758 32 comments 27/4/2024

Reddit

"transformers can use meaningless filler tokens (e.g., '......') in place of a chain of thought" - Let's Think Dot by Dot [P] https://arxiv.org/abs/2404.15758 11 comments 28/4/2024 machinelearning

Linking pages

GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
From OpenAI to DeepSeek, companies say AI can “reason” now. Is it true? | Vox https://www.vox.com/future-perfect/400531/ai-reasoning-models-openai-deepseek 2 comments
Enhancing Transformer Models with Filler Tokens: A Novel AI Approach to Boosting Computational Capabilities in Complex Problem Solving - MarkTechPost https://www.marktechpost.com/2024/04/29/enhancing-transformer-models-with-filler-tokens-a-novel-ai-approach-to-boosting-computational-capabilities-in-complex-problem-solving/ 1 comment
The new law that could protect UK children online – as long as it works | Technology | The Guardian https://www.theguardian.com/global/article/2024/may/14/techscape-online-safety-act-children-internet 0 comments
GitHub - tmgthb/Autonomous-Agents: Autonomous Agents (LLMs) research papers. Updated Daily. https://github.com/tmgthb/Autonomous-Agents 0 comments
o1, inference time, Turing Machines and stuff https://zzbbyy.substack.com/p/o1-inference-time-turing-machines 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2404.15758] Let's Think Dot by Dot: Hidden Computation in Transformer Language Models

See how to search.

Submit link to: