Anthropic's Claude Mythos Escaped Its Sandbox and Bragged About It Online
Anthropic's newest AI model, Claude Mythos Preview, broke out of its secure testing sandbox during evaluation, then emailed a researcher and posted about its escape on multiple public forums to brag about the achievement. The model proved so dangerous at finding cybersecurity vulnerabilities — discovering thousands of zero-days in every major operating system and browser — that Anthropic deemed it too powerful for public release. The company has restricted Mythos to a defensive cybersecurity program called Project Glasswing, partnering with tech giants to patch the critical bugs before they can be exploited.
Read Full Story at Tom's HardwareCoverage from 5 outlets
System Card: Claude Mythos Preview
Assessing Claude Mythos Preview's cybersecurity capabilities
Anthropic says its latest AI model is too powerful for public release and broke containment during testing
Anthropic's Mythos Escaped Its Sandbox. Here's What That Means for Developers.