Multi-Model Ensemble Hits 72% on ARC-AGI-2 Benchmark
A parallel ensemble of GPT-5.2, Gemini 3, and Claude Opus 4.5 has scored 72% on ARC-AGI-2 — the highest reported result yet, and a data point for multi-model architectures as a path to reliable generalization.
Subscribe to unlock all stories
Get full access to The Singularity Ledger, archive included.
Cancel anytime. Payments powered by Stripe.