Multi-Model Ensemble Hits 72% on ARC-AGI-2 Benchmark

A parallel ensemble of GPT-5.2, Gemini 3, and Claude Opus 4.5 has scored 72% on ARC-AGI-2 — the highest reported result yet, and a data point for multi-model architectures as a path to reliable generalization.

Subscribe to unlock all stories

Get full access to The Singularity Ledger, archive included.

Cancel anytime. Payments powered by Stripe.