'Humanity's Last Exam' Benchmark Published in Nature, Adopted by All Major Labs

The Center for AI Safety's ultra-hard benchmark, designed to test the outer limits of frontier model capabilities, is now a Nature paper and an industry standard.

Subscribe to unlock all stories

Get full access to The Singularity Ledger, archive included.

Cancel anytime. Payments powered by Stripe.