UK AISI: Claude Mythos Preview achieves 73% on expert cyber tasks — first model to complete a full network attack
The UK AI Safety Institute has published an evaluation of Anthropic's Claude Mythos Preview model showing significant advances in autonomous cyber capabilities. The model is the first to successfully complete a full 32-step simulated attack on a corporate network.