Linking pages
Linked pages
- [2406.02061] Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models https://arxiv.org/abs/2406.02061 394 comments
- [2410.05229] GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models https://arxiv.org/abs/2410.05229 267 comments
- LLMs don’t do formal reasoning - and that is a HUGE problem https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and 187 comments
- [2407.01687] Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning https://arxiv.org/abs/2407.01687 19 comments
- [2403.04121] Can Large Language Models Reason and Plan? https://arxiv.org/abs/2403.04121 3 comments
- Can Large Language Models Reason? - by Melanie Mitchell https://aiguide.substack.com/p/can-large-language-models-reason 1 comment
- [2402.19450] Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap https://arxiv.org/abs/2402.19450 1 comment
- [2307.02477] Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks https://arxiv.org/abs/2307.02477 0 comments
Related searches:
Search whole site: site:aiguide.substack.com
Search title: The LLM Reasoning Debate Heats Up - by Melanie Mitchell
See how to search.