The LLM Reasoning Debate Heats Up - by Melanie Mitchell - discu.eu

Linking pages

What is Anthropic's AI Computer Use? - by Michael Spencer https://www.ai-supremacy.com/p/what-is-anthropics-ai-computer-use 0 comments

Linked pages

[2406.02061] Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models https://arxiv.org/abs/2406.02061 394 comments
[2410.05229] GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models https://arxiv.org/abs/2410.05229 267 comments
LLMs don’t do formal reasoning - and that is a HUGE problem https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and 187 comments
[2407.01687] Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning https://arxiv.org/abs/2407.01687 19 comments
[2403.04121] Can Large Language Models Reason and Plan? https://arxiv.org/abs/2403.04121 3 comments
Can Large Language Models Reason? - by Melanie Mitchell https://aiguide.substack.com/p/can-large-language-models-reason 1 comment
[2402.19450] Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap https://arxiv.org/abs/2402.19450 1 comment
[2307.02477] Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks https://arxiv.org/abs/2307.02477 0 comments

Related searches:

Search whole site: site:aiguide.substack.com

Search title: The LLM Reasoning Debate Heats Up - by Melanie Mitchell

See how to search.

Submit link to: