Hacker News
- Results of "Humanity's Last Exam" benchmark published https://scale.com/blog/humanitys-last-exam-results 140 comments
- AI now surpasses humans in almost all performance benchmarks https://newatlas.com/technology/ai-index-report-global-impact/ 2 comments
- AI now surpasses humans in almost all performance benchmarks https://newatlas.com/technology/ai-index-report-global-impact/ 15 comments
- Waymo outperforms comparable human benchmarks over 7M+ miles https://waymo.com/blog/2023/12/waymo-significantly-outperforms.html 861 comments
- Agent57: Outperforming the human Atari benchmark https://deepmind.com/blog/article/Agent57-Outperforming-the-human-Atari-benchmark 44 comments
- Google T5 scores 88.9 on SuperGLUE Benchmark, approaching human baseline https://super.gluebenchmark.com/leaderboard 235 comments
- MT-DNN Achieves Human Performance in General Language Understanding Benchmark https://blogs.msdn.microsoft.com/stevengu/2019/06/20/microsoft-achieves-human-performance-estimate-on-glue-benchmark/ 14 comments
- Air pollution greatest global threat to human health, says benchmark study https://phys.org/news/2023-08-air-pollution-greatest-global-threat.html 18 comments
- Benchmarking LLMs against human expert-curated biomedical knowledge graphs https://www.sciencedirect.com/science/article/pii/S2667318524000023 5 comments
- Not only is AI development accelerating, the rate it is surpassing human-level performance on each new benchmark measurement model is accelerating too. https://contextual.ai/plotting-progress-in-ai/ 30 comments futurology
- AI now surpasses humans in almost all performance benchmarks https://newatlas.com/technology/ai-index-report-global-impact/ 191 comments artificial
- AI now surpasses humans in almost all performance benchmarks https://newatlas.com/technology/ai-index-report-global-impact/ 444 comments futurology
- Human Benchmark - Quanto siete medi? https://www.reddit.com/r/italy/comments/hankii/human_benchmark_quanto_siete_medi/ 90 comments italy
- [R] Agent57: Outperforming the Atari Human Benchmark https://deepmind.com/blog/article/Agent57-Outperforming-the-human-Atari-benchmark 6 comments reinforcementlearning
- Detroit: Become Human Benchmark - Wine vs Windows https://www.youtube.com/watch?v=iH81xrJLTdM 62 comments linux_gaming
- Human and machine aspects of typing latency and text editor benchmarks https://pavelfatin.com/typing-with-pleasure 5 comments programming
- Once Human Handheld Performance Benchmark Review - Steam Deck and ROG Ally Tested https://www.techpowerup.com/review/once-human-steam-deck-fps-performance-benchmark/ 13 comments amd
- Waypoint - The official Waymo blog: Waymo significantly outperforms comparable human benchmarks over 7+ million miles of rider-only driving https://waymo-blog.blogspot.com/2023/12/waymo-significantly-outperforms.html?m=1 8 comments futurology
- Google AI, DeepMind And The University of Toronto Introduce DreamerV2, The First Reinforcement Learning (RL) Agent That Outperforms Humans on The Atari Benchmark https://arxiv.org/abs/2010.02193 3 comments artificial
- Tabula Sapiens is a benchmark, first-draft human cell atlas of nearly 500,000 cells from 24 organs of 15 normal human subjects. https://cellxgene.cziscience.com/collections/e5f58829-1a66-40b5-a624-9046778e74f5 3 comments science
- Scientists have calculated that the average person can recognize 5000 faces, giving a benchmark for comparing AI facial recognition to human performance https://www.smithsonianmag.com/smart-news/average-person-can-recognize-5000-faces-180970527/ 3 comments artificial
- Google DeepMind has trained a reinforcement learning agent called AlphaDev to find better sorting routines. It has discovered small sorting algorithms from scratch that outperform previously known human benchmarks and have now been integrated into the LLVM standard C++ sort library. https://www.deepmind.com/blog/alphadev-discovers-faster-sorting-algorithms 109 comments science
- LLMs saturate another hacking benchmark: "Frontier LLMs are better at cybersecurity than previously thought ... advanced LLMs could hack real-world systems at speeds far exceeding human capabilities." https://x.com/PalisadeAI/status/1866116594968973444 8 comments artificial
- Humans may have emerged as a distinct species as early as 350,000 years ago, according to a new study. DNA from the skeleton of a boy who lived in what's now South Africa may be the best benchmark so far for gauging when Homo sapiens originated in Africa. https://www.sciencenews.org/article/ancient-boys-dna-pushes-back-date-earliest-humans 126 comments science