MathArena - discu.eu

Hacker News

Gemini 2.5 gets 24.4% on MathArena USAMO beating previous top score of 4.7% https://matharena.ai/ 10 comments 2/4/2025

Reddit

MathArena: Evaluating LLMs on Uncontaminated Math Competitions https://matharena.ai/ 7 comments 29/4/2025 math

Linking pages

New study shows why simulated reasoning AI models don’t yet live up to their billing - Ars Technica https://arstechnica.com/ai/2025/04/new-study-shows-why-simulated-reasoning-ai-models-dont-yet-live-up-to-their-billing/ 4 comments