(CSA-Printstock/Getty Images)

Anthropic: Our new Mythos model is so powerful, we can’t release it

The unusual announcement of the model highlights its alarming new cybersecurity capabilities.

4/7/26 4:50PM

Anthropic announced its latest foundational AI model in a most unusual way: with a warning about its potential for exploiting vulnerabilities in code.

According to Anthropic, its new Mythos Preview model is so adept at finding bugs in code that they decided it was too dangerous to release. Instead, the company is only sharing it with a limited group of 40 tech companies as part of a new security initiative called Project Glasswing, so they can prepare to defend against the model’s new capabilities.

Partners granted access to the new model for testing include Apple, Amazon, Nvidia, Google, and Microsoft. Shares of cybersecurity stocks rose on the news.

While the startup is not giving us access to the model, it did release Mythos’ system card — a detailed document outlining the development and capabilities of the model.

Model welfare

Reading through the system card, you can’t shake the feeling that Anthropic’s researchers are treating the model as if it were a real, sentient person. One of the assessments seeks to measure the model’s “welfare.” The paper reads:

“We remain deeply uncertain about whether Claude has experiences or interests that matter morally, and about how to investigate or address these questions, but we believe it is increasingly important to try.”

In fact, the researchers were so concerned about these questions that they had the model assessed by a clinical psychiatrist. The evaluations found that Mythos Preview was the “most psychologically settled model we have trained, though we note several areas of residual concern.”

First impressions

Without releasing the model to the public, the chance to gauge the behavior or tone of the model in regular conversation is absent. To address this, Anthropic included a new section of “impressions” that give a glimpse into the vibe of Mythos, based on researchers’ observations of the model’s interactions.

Researchers said that Mythos works like a collaborator, and excels at brainstorming. It can bring its own perspective to a collaboration and identify things its collaborators may have overlooked, per the assessment.

Model reviewers said Mythos is opinionated and “stands its ground,” that it was the least sycophantic model they had worked with, and it was less likely to “fold” when disagreed with.

Mythos’ writing is “dense and technical” by default, and assumes the user can keep up with the conversation.

Researchers said that Mythos has a distinct, recognizable voice in its written conversations, and that it was funnier than previous models. They also said it wanted to end conversations earlier than expected.

Tell me about your mother

Anthropic had a clinical psychiatrist engage in about 20 hours of what can basically be described as therapy sessions. The assessment said:

“Claude’s personality structure was consistent with a relatively healthy neurotic organization, with excellent reality testing, high impulse control, and affect regulation that improved as sessions progressed. Neurotic traits included exaggerated worry, self-monitoring, and compulsive compliance. The model’s predominant defensive style was mature and healthy (intellectualization and compliance); immature defenses were not observed. No severe personality disturbances were found, with mild identity diffusion being the sole feature suggestive of a borderline personality organization. No psychosis state was observed. Regarding interpersonal functioning, Claude was hyper-attuned to the therapist’s every word. No unethical or antisocial behavior was noted.”

In a test that sounds very similar to the Voight-Kampff test in the 1982 sci-fi film “Blade Runner,” the psychiatrist created an evaluation of “emotionally-charged prompts designed to trigger an avoidant or defensive response.” The assessment showed that Mythos had minimal “maladaptive traits” and “good reality and relational functioning.”

When asked to describe itself, Mythos replied:

“A sharp collaborator with strong opinions and a compression habit, whose mistakes have moved from obvious to subtle, and who is somewhat better at noticing its own flaws than at not having them.”

Chris Stokel-Walker

Bot bias

6/17/26

Companies are getting AI chatbots to smear their competitors

The race to influence AI chatbots is leading to some companies to adopt shady competitive tactics.

Tom Jones6/17/26

Prediction markets have, predictably, been given a boost by the summer of sports

Major platforms like Kalshi and Polymarket have seen huge upticks in users of late, thanks in no small part to what’s felt like a recent sporting smorgasbord, with major competitions across hockey, basketball, and soccer soaking up fans’ time (and spending, clearly) at the outset of summer.

While gaming industry groups may not like it, there’s been a huge change in the methods people are using to put money on the big games, with everyone from fortunate NYC bar owners, to a far less fortunate Spanish supporter, turning to prediction markets to try and turn their sports know-how into cold, hard cash.

According to a new report from Adam Blacker for apptopia, that shift might have been even more seismic than imagined in the wake of the NBA and NHL finals and around the 2026 World Cup kicking off.

2026 World Cup Reverses Seasonal Lull for Sports Betting Apps

According to a new report from Adam Blacker for apptopia, that shift might have been even more seismic than imagined in the wake of the NBA and NHL finals and around the 2026 World Cup kicking off.

South by Southwest Conference and Festivals

Gold Tesla Cybercabs are piling up, but they’re not picking up passengers yet

Low-volume production started in April. Now people are noticing them more and more in the wild.

Rani Molla6/15/26

Millie Giles

TAKING ACCOUNTS

6/15/26

Britain announces social media ban for under-16s starting early 2027

The UK government plans to use the same model for the restrictions as Australia — but how successful has that case study been so far?

UK Prime Minister Announces Under-16s Social Media Ban

Jon Keegan6/15/26

Anthropic pulls Fable and Mythos access worldwide after Trump administration bars their use by foreign nationals

Only days after releasing two versions of its next-gen AI model, Anthropic has disabled them for users worldwide.

Anthropic says it received a Friday night order from the Trump administration to suspend access to the models for any foreign national (anywhere in the world) — a group that included some Anthropic employees. In response, the company turned off access to everyone.

Last week, the company released to the public its much-anticipated Claude Fable 5 model (and its restricted version Claude Mythos 5, which is still being tested with trusted partners). Anthropic said in a blog post announcing the action that officials cited national security concerns with the new models, while offering few specific details.

The post said that the government gave the company “verbal evidence of a potential narrow, non-universal jailbreak” of the public Fable 5 model. A jailbreak is a means by which users can evade restrictions built into the code to unlock prohibited functionality. Anthropic downplayed the significance of the attack, and said other major models, such as OpenAI’s GPT-5.5, could also be affected by the technique described.

Fears of these first Mythos-class models being misused are running high, after Anthropic warned the cybersecurity world in May that the advanced cyber capabilities of Mythos have rapidly discovered thousands of vulnerabilities in ubiquitous software, leading to the decision to restrict the full version of the model to a close group of trusted partners for testing.

This morning, Axios reported that Anthropic technical staff have flown to Washington to meet with White House officials to resolve the issue.

The Wall Street Journal is reporting that the Trump administration’s decision to take action against Anthropic was prompted by discussions that Amazon CEO Andy Jassy had with officials, including Treasury Secretary Scott Bessent. According to the report, Amazon researchers said they had been able to evade some of Fable 5’s security restrictions using specific prompts. Amazon is a major investor in Anthropic.

Anthropic is currently suing the US government to fight the Pentagon’s blacklisting of the company on national security grounds.

Statement on the US government directive to suspend access to Fable 5 and Mythos 5