'Humanity's Last Exam' Benchmark Published in Nature, Adopted by All Major Labs
The Center for AI Safety's ultra-hard benchmark, designed to test the outer limits of frontier model capabilities, is now a Nature paper and an industry standard.
Subscribe to unlock all stories
Get full access to The Singularity Ledger, archive included.
Cancel anytime. Payments powered by Stripe.