discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Hacker News
Gemini 2.5 gets 24.4% on MathArena USAMO beating previous top score of 4.7%
https://matharena.ai/
10 comments
2/4/2025
Reddit
MathArena: Evaluating LLMs on Uncontaminated Math Competitions
https://matharena.ai/
7 comments
29/4/2025
math
Linking pages
New study shows why simulated reasoning AI models don’t yet live up to their billing - Ars Technica
https://arstechnica.com/ai/2025/04/new-study-shows-why-simulated-reasoning-ai-models-dont-yet-live-up-to-their-billing/
4 comments