Reports of LLMs mastering math have been greatly exaggerated - discu.eu

Linking pages

New study shows why simulated reasoning AI models don’t yet live up to their billing - Ars Technica https://arstechnica.com/ai/2025/04/new-study-shows-why-simulated-reasoning-ai-models-dont-yet-live-up-to-their-billing/ 4 comments

Linked pages

Chatbots Are Cheating on Their Benchmark Tests - The Atlantic https://www.theatlantic.com/technology/archive/2025/03/chatbots-benchmark-tests/681929/ 6 comments
[2503.21934] Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad https://arxiv.org/abs/2503.21934 4 comments
AlphaGeometry2: Impressive accomplishment, but still a long path ahead https://garymarcus.substack.com/p/alphageometry2-impressive-accomplishment 0 comments

Related searches:

Search whole site: site:garymarcus.substack.com

Search title: Reports of LLMs mastering math have been greatly exaggerated

See how to search.

Submit link to: