- It’s getting harder to measure just how good AI is getting https://www.vox.com/future-perfect/394336/artificial-intelligence-openai-o3-benchmarks-agi 10 comments futurology
Linked pages
- OpenAI o3 Breakthrough High Score on ARC-AGI-Pub https://arcprize.org/blog/oai-o3-pub-breakthrough 1772 comments
- AI achieves silver-medal standard solving International Mathematical Olympiad problems - Google DeepMind https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/ 904 comments
- https://www.axios.com/2025/01/07/openai-o3-college-students-computer-science 45 comments
- https://openai.com/12-days#day-8 42 comments
- What is artificial intelligence? Your AI questions, answered. - Vox https://www.vox.com/future-perfect/2018/12/21/18126576/ai-artificial-intelligence-machine-learning-safety-alignment 8 comments
- ARC Prize https://arcprize.org/ 3 comments
- MMLU Benchmark (Multi-task Language Understanding) | Papers With Code https://paperswithcode.com/sota/multi-task-language-understanding-on-mmlu 0 comments
- [2311.12022] GPQA: A Graduate-Level Google-Proof Q&A Benchmark https://arxiv.org/abs/2311.12022 0 comments
- Some warn that AI progress is slowing down, but the field isn’t done | Vox https://www.vox.com/future-perfect/389997/artificial-intelligence-openai-google-microsoft-chatgpt-progress-scaling 0 comments
- Why AI Progress Is Increasingly Invisible | TIME https://time.com/7205359/why-ai-progress-is-increasingly-invisible/ 0 comments
Related searches:
Search whole site: site:www.vox.com
Search title: It’s getting harder to measure just how good AI is getting | Vox
See how to search.