reward hacking

By Brian Buntz | April 22, 2026

How a Berkeley team broke 8 major AI benchmarks. Six of them hit 100% without solving a single task

Every couple of months, a new AI model climbs a benchmark leaderboard. Companies cite the numbers in press releases, investors use them to justify valuations, and engineers use them to pick which model to deploy. A new project out of UC Berkeley helps confirm something many had suspected: leaderboards can be less reliable than they…

reward hacking

Search R&D World