Scene from the 1991 movie “Terminator 2: Judgment Day” (CBS/Getty Images)

Anthropic ponders self-improving AI

Anthropic says Claude already writes 80% of its code. A new post asks what happens when the models can improve themselves — and whether anyone could stop them.

Jon Keegan

6/5/26 8:24AM

As AI models rapidly improve at writing code, the role of humans in the process of software development is shifting to one of merely oversight and direction. Anthropic says that as of May 2026, Claude has written up about 80% of its internal code.

But what happens when the AI models don’t need humans any more, and the models can write the code to improve themselves autonomously?

This concept — known as recursive self-improvement — is currently getting a lot of attention in the AI industry. The risks of losing control of an AI system as it exponentially improves itself to the detriment (and attempted extermination) of humans is what happened with Skynet in “The Terminator.”

Anthropic ponders this concept in a long blog post authored by Marina Favaro and Jack Clark of the Anthropic Institute, which checks in on how far along the company’s models might be to something that looks like recursive self-improvement and how this could play out.

Models are rapidly improving

The authors wrote that across the industry, AI leaderboards are seeing consistent high scores from models “saturate” key coding benchmarks like SWE-bench. The models are now able to do bigger, more complex tasks — Claude Opus went from handling four-minute software tasks in 2024 to tackling 12-hour tasks in 2026.

Anthropic engineers are experiencing this dramatic shift in their work, according to a developer quoted in the post:

“I started leaning hard into Claudifying about a year ago. That’s been a crazy adventure and it’s now been ~5 months since I last wrote any code myself.”

The post includes a compelling chart showing a steady rise in lines of Claude-created internal code starting last year, followed by a steep jump with the arrival of Mythos. Not only was it written mostly by AI, but the quality of the code is expected to surpass human developers this year.

Anthropic chart - code contributed per person by quarter — The amount of code generated by AI within Anthropic has rapidly increased this year (Anthropic)

Humans have better “research taste”

The paper cites several key areas where Claude has excelled: it is very good at finding bugs in older code, it can be used to quickly diagnose and fix live system failures, and it can set up iterative code-rewriting loops that are currently able to speed up software around 52x on average (using Mythos).

In one example cited in the post, Claude made 800 fixes to an API, drastically reducing errors — work that would have taken a human engineer an estimated four years. This is the kind of work that would probably not even have been done in the first place, the authors added.

But humans appear to still have the edge in designing the crucial AI tests and experiments that help move AI forward. Humans have better “research taste,” though Claude is getting better at this, the paper notes.

Existential questions

Some of Anthropic’s developers seem to be grappling with existential issues related to their work. One employee was quoted as saying:

“On days where everything works well, I can’t help but think nothing I do matters, everything is automated and better and faster than I ever will be. But then there are days where everything breaks and I don’t understand why and I realize I have no idea what I’ve been up to anymore.”

Maybe it can’t happen

The authors frankly acknowledge that such self-improving systems might not even be possible. Human guidance has led to all of the breakthroughs to date, thanks to all those clever experiments we designed. Maybe AI is just a very useful tool for speeding up repetitive testing of the ideas we have — scale, fix, repeat.

So are we headed toward a world of self-improving AI models that we can’t keep an eye on? Anthropic is basically saying, we don’t really know. Super advanced AI systems could cure disease and power helpful robots, but it could also lead to other unforeseen negative consequences.

The authors lay out three possible scenarios for how they think this could play out:

1. Things could plateau: Supply chain constraints for data centers, chips, or electricity could preclude the next big leap in computing. Or maybe the crazy, consistent scaling we have seen just stops working.

2. Continued gains going forward: The most likely scenario described by the authors predicts that the work will essentially continue at pace, seeing “compounding efficiency gains.” But as code writing speeds up, human code review would still be a major bottleneck.

3. AI starts to build — and improve — itself: With humans largely out of the loop, the only constraint will be physical infrastructure and energy. Self-improving AI systems might decide to halt AI development, but they also could become “misaligned” with human safety:

“The rare occurrences of misalignment present in today’s models could compound as the models build their successors, growing more frequent but less understood until we lose control of them.”

Slow it down?

As to what the industry should do at this moment as it hurls into uncertainty, the paper offers some ways forward.

The authors considered the growing call to simply slow down AI development, to make sure the technology is used for good:

“If it were possible to effectively slow the development of this technology to give ourselves more time to deal with its immense implications, we think that would likely be a good thing.”

This would require a kind of global coordination that seems increasingly unlikely given today’s geopolitical problems. But even if we all could agree on what a pause might look like, bad actors could use that pause to level up their attacks, the authors argued.

A verification regime like a nuclear weapons treaty could serve as a model for international cooperation to regulate responsible development of self-improving systems, but AI moves much faster than the pace of decades-long diplomacy. As the authors wrote: “We don’t have that long.”

Chris Stokel-Walker

Bot bias

6/17/26

Companies are getting AI chatbots to smear their competitors

The race to influence AI chatbots is leading to some companies to adopt shady competitive tactics.

Tom Jones6/17/26

Prediction markets have, predictably, been given a boost by the summer of sports

Major platforms like Kalshi and Polymarket have seen huge upticks in users of late, thanks in no small part to what’s felt like a recent sporting smorgasbord, with major competitions across hockey, basketball, and soccer soaking up fans’ time (and spending, clearly) at the outset of summer.

While gaming industry groups may not like it, there’s been a huge change in the methods people are using to put money on the big games, with everyone from fortunate NYC bar owners, to a far less fortunate Spanish supporter, turning to prediction markets to try and turn their sports know-how into cold, hard cash.

According to a new report from Adam Blacker for apptopia, that shift might have been even more seismic than imagined in the wake of the NBA and NHL finals and around the 2026 World Cup kicking off.

2026 World Cup Reverses Seasonal Lull for Sports Betting Apps

According to a new report from Adam Blacker for apptopia, that shift might have been even more seismic than imagined in the wake of the NBA and NHL finals and around the 2026 World Cup kicking off.

South by Southwest Conference and Festivals

Gold Tesla Cybercabs are piling up, but they’re not picking up passengers yet

Low-volume production started in April. Now people are noticing them more and more in the wild.

Rani Molla6/15/26

Millie Giles

TAKING ACCOUNTS

6/15/26

Britain announces social media ban for under-16s starting early 2027

The UK government plans to use the same model for the restrictions as Australia — but how successful has that case study been so far?

UK Prime Minister Announces Under-16s Social Media Ban

Jon Keegan6/15/26

Anthropic pulls Fable and Mythos access worldwide after Trump administration bars their use by foreign nationals

Only days after releasing two versions of its next-gen AI model, Anthropic has disabled them for users worldwide.

Anthropic says it received a Friday night order from the Trump administration to suspend access to the models for any foreign national (anywhere in the world) — a group that included some Anthropic employees. In response, the company turned off access to everyone.

Last week, the company released to the public its much-anticipated Claude Fable 5 model (and its restricted version Claude Mythos 5, which is still being tested with trusted partners). Anthropic said in a blog post announcing the action that officials cited national security concerns with the new models, while offering few specific details.

The post said that the government gave the company “verbal evidence of a potential narrow, non-universal jailbreak” of the public Fable 5 model. A jailbreak is a means by which users can evade restrictions built into the code to unlock prohibited functionality. Anthropic downplayed the significance of the attack, and said other major models, such as OpenAI’s GPT-5.5, could also be affected by the technique described.

Fears of these first Mythos-class models being misused are running high, after Anthropic warned the cybersecurity world in May that the advanced cyber capabilities of Mythos have rapidly discovered thousands of vulnerabilities in ubiquitous software, leading to the decision to restrict the full version of the model to a close group of trusted partners for testing.

This morning, Axios reported that Anthropic technical staff have flown to Washington to meet with White House officials to resolve the issue.

The Wall Street Journal is reporting that the Trump administration’s decision to take action against Anthropic was prompted by discussions that Amazon CEO Andy Jassy had with officials, including Treasury Secretary Scott Bessent. According to the report, Amazon researchers said they had been able to evade some of Fable 5’s security restrictions using specific prompts. Amazon is a major investor in Anthropic.

Anthropic is currently suing the US government to fight the Pentagon’s blacklisting of the company on national security grounds.

Statement on the US government directive to suspend access to Fable 5 and Mythos 5