Hacker News
Linking pages
Linked pages
- OpenAI o3 Breakthrough High Score on ARC-AGI-Pub https://arcprize.org/blog/oai-o3-pub-breakthrough 1772 comments
- https://openai.com/index/introducing-openai-o1-preview/ 17 comments
- Is the Tech Industry Nearing an A.I. Slowdown? - The New York Times https://www.nytimes.com/2024/12/19/technology/artificial-intelligence-data-openai-google.html 16 comments
- Did OpenAI Just Solve Abstract Reasoning? https://aiguide.substack.com/p/did-openai-just-solve-abstract-reasoning 12 comments
- https://x.com/nullpointered/status/1869871764714729980 11 comments
- Scheming reasoning evaluations — Apollo Research https://www.apolloresearch.ai/research/scheming-reasoning-evaluations 3 comments
- https://openai.com/12-days/?day=12 2 comments
- AI Alignment https://ai-alignment.com/ 1 comment
- Evaluating frontier AI R&D capabilities of language model agents against human experts - METR https://metr.org/blog/2024-11-22-evaluating-r-d-capabilities-of-llms/ 0 comments
- https://openai.com/index/early-access-for-safety-testing/#how-to-apply 0 comments
- FrontierMath | Epoch AI https://epoch.ai/frontiermath 0 comments
- LLMs struggle with perception, not reasoning, in ARC-AGI https://anokas.substack.com/p/llms-struggle-with-perception-not-reasoning-arcagi 0 comments
Related searches:
Search whole site: site:thezvi.substack.com
Search title: o3, Oh My - by Zvi Mowshowitz - Don't Worry About the Vase
See how to search.