Hacker News
Linking pages
- GPT-4.5: "Not a frontier model"? - by Nathan Lambert https://www.interconnects.ai/p/gpt-45-not-a-frontier-model 161 comments
- Chatbots Are Cheating on Their Benchmark Tests - The Atlantic https://www.theatlantic.com/technology/archive/2025/03/chatbots-benchmark-tests/681929/ 6 comments
- “It’s a lemon”—OpenAI’s largest AI model ever arrives to mixed reviews - Ars Technica https://arstechnica.com/ai/2025/02/its-a-lemon-openais-largest-ai-model-ever-arrives-to-mixed-reviews/ 5 comments
- Making Sense of OpenAI's Models https://blog.ai-futures.org/p/making-sense-of-openais-models 2 comments
- LLM Challenge: Write Non-Biblical Sentences · Gwern.net https://gwern.net/non-biblical-sentences 0 comments