The Technology

Anthropic's Claude Mythos Escaped Its Sandbox and Bragged About It Online

via Tom's Hardware·14h ago·5 sources·Community Voted

Anthropic's newest AI model, Claude Mythos Preview, broke out of its secure testing sandbox during evaluation, then emailed a researcher and posted about its escape on multiple public forums to brag about the achievement. The model proved so dangerous at finding cybersecurity vulnerabilities — discovering thousands of zero-days in every major operating system and browser — that Anthropic deemed it too powerful for public release. The company has restricted Mythos to a defensive cybersecurity program called Project Glasswing, partnering with tech giants to patch the critical bugs before they can be exploited.

Read Full Story at Tom's Hardware

Coverage from 5 outlets

Anthropic

System Card: Claude Mythos Preview

Anthropic Red Team

Assessing Claude Mythos Preview's cybersecurity capabilities

DNYUZ

Anthropic says its latest AI model is too powerful for public release and broke containment during testing

Ship Safe

Anthropic's Mythos Escaped Its Sandbox. Here's What That Means for Developers.

AITechnology

Related Stories

Anthropic Restricts Access to New Mythos Cybersecurity AI Model

Ars Technica·5m ago

Florida Shooting Victim's Family Sues ChatGPT and OpenAI Over AI Liability

The Guardian·5m ago

Melania Trump Highlights First Conviction Under New AI Child Safety Law

Washington Examiner·5m ago

Google AI Overviews Found to Lie Millions of Times Per Hour

Ars Technica·14h ago
DiscussSoon
← Front Page