- Study shows that the length of tasks Als can do is doubling every 7 months. Extrapolating this trend predicts that in under five years we will see AI agents that can independently complete a large fraction of software tasks that currently take humans days https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/ 111 comments futurology
Linking pages
Related searches:
Search whole site: site:metr.org
Search title: Measuring AI Ability to Complete Long Tasks - METR
See how to search.