Inside AI Policy

April 29, 2025

Stanford’s 2025 AI index highlights benchmark vacuum for AI responsibility, calls out METR

By Mariam Baksh / April 8, 2025

A high-profile organization’s assessment of AI models is insufficient for comparing responsibility across developers, Stanford University researchers said in their annual report on the status of the industry and policies governing it.

“Third-party evaluators like Gryphon, Apollo Research, and METR assess only select models, and their findings cannot be widely validated by the broader AI community,” reads the 2025 AI Index released April 7 by Stanford’s Institute for Human-Centered AI.

The annual AI index is a comprehensive and --...


Log in to access this content.


Not a subscriber? Sign up for 30 days free access to exclusive news and analysis on artificial intelligence regulations and more.