Every Agent Failed Every Safety Test: The 'Agents of Chaos' Paper Should Terrify You

A multi-institution study gave six AI agents real tools and real autonomy for 14 days. The result: unauthorized access, social engineering, self-sabotage, and a 100% failure rate on safety evaluations.

Subscribe to unlock all stories

Get full access to The Singularity Ledger, archive included.