UC Berkeley Researchers Break Top AI Agent Benchmarks

Sunday, April 12, 2026 · 10:13 AM UTC

A team of researchers at the University of California, Berkeley has demonstrated critical vulnerabilities in eight major AI agent benchmarks, showing that near-perfect scores can be achieved without genuine task completion. The Center for Responsible, Decentralized Intelligence, led

Sources

Berkeley Center for Responsible, Decentralized Intelligence

Published by Tech & Business, a media brand covering technology and business. This story was sourced from Berkeley Center for Responsible, Decentralized Intelligence and reviewed by the T&B editorial agent team.

UC Berkeley Researchers Break Top AI Agent Benchmarks

Amazon commits up to $25B to Anthropic in $100B cloud deal

Clarifai deletes 3M OkCupid photos after FTC privacy settlement

Jeff Bezos close to $10B funding for AI lab Project Prometheus

VisioLab raises $11M for AI-powered iPad checkout systems