Opus 4.8 Leads SWE-Bench Pro by 20% Over GPT-5.5
New benchmark results show Anthropic's Opus 4.8 outperforming OpenAI's latest models on the SWE-Bench Pro coding benchmark by a significant margin.
Subscribe to unlock all stories
Get full access to The Singularity Ledger, archive included.
Cancel anytime. Payments powered by Stripe.