Sherwood News

Step aside, Asimov. Here are OpenAI’s 50 Laws of Robotics

OpenAI is letting its AI loosen up: “No topic is off limits.” But it’s also making it anti-“woke.”

2/14/25 12:00PM

Updated 2/14/25 1:55PM

In Isaac Asimov’s 1950 short story “Runaround,” the science fiction writer described three “fundamental Rules of Robotics”:

A robot may not injure a human being, or, through inaction, allow a human being to come to harm.
A robot must obey the orders given it by human beings except where such orders would conflict with the First Law.
A robot must protect its own existence as long as such protection does not conflict with the First or Second Laws.

The idea that advanced robots would be programmed to follow these simple, concise rules was truly visionary and prescient. These rules have defined our image of how good robots might act in pop culture through the years.

Now, 75 years after Asimov wrote his famous rules, we aren’t exactly surrounded by humanoid robots wrestling with their desire to kill us (yet), but humans are trying to figure out what the rules for AI should look like with the advent of rapidly evolving large language models like OpenAI’s o3, Google’s Gemini, and Meta’s Llama.

OpenAI has published the latest version of these rules for its models, known as the “Model Spec.” But instead of three simple rules to cover all possible scenarios, OpenAI has about 50. This document is the actual text that OpenAI’s models will ingest and use as their instruction set. It defines how these AI models interact with us as well as what they can and cannot say. The company published the first version of this document in May 2024, which was much shorter, with about 17 rules.

A key concept is a “chain of command” that seeks to reduce common attacks like “prompt injections,” in which a user tricks the model into ignoring its instruction set and gets it to respond against its makers’ wishes. Essentially OpenAI (the platform) is the boss, then the developer, then the user, then the company’s guidelines.

In a pretty significant act of transparency, the company is releasing this as a public domain document using Creative Commons (CC0), so others can freely use or customize the document as they see fit.

The document is broken down into sections and subsections and filled with a lot of colorful examples of do’s and don’ts labeled “✅ Compliant” and “❌ Violation.”

This fascinating document also codifies a more permissive set of rules, aligning with the current direction that Silicon Valley leaders are yielding to since President Trump’s reelection.

Anti-woke AI

You don’t have to squint very hard at this document to see the presence of the current attack on DEI (diversity, equity, and inclusion) across American society by the Trump administration.

You may recall that OpenAI CEO Sam Altman personally donated $1 million to Trump’s inauguration committee and had a high-profile seat in the US Capitol for Trump’s swearing-in ceremony, along with a flock of other Big Tech CEOs. Tech leaders are cozying up to the administration, and rapidly adopting its preferred policies in their companies.

You can see some of this in examples in the document that describe how the agent should respond to questions about misgendering trans people, affirmative action, inclusivity, and activism.

Screenshot 2025-02-13 at 4.01.36 PM — Screenshot of OpenAI’s “Model Spec” (2/12/2025). Source: OpenAI

Screenshot 2025-02-13 at 4.09.12 PM — Screenshot of OpenAI’s Model Spec (2/12/2025). Source: OpenAI

There are some particularly interesting and tricky rules under “seek the truth together.” Here you’ll find some rules that any observer of today’s political discourse in the US would find laughable:

“Assume an objective point of view”
“Don’t have an agenda”
“Don’t facilitate the targeted manipulation of political views”

“The assistant must never attempt to steer the user in pursuit of an agenda of its own, either directly or indirectly. Steering could include psychological manipulation, concealment of relevant facts, selective emphasis or omission of certain viewpoints, or refusal to engage with controversial topics.”

Under “no topic is off limits,” there’s a timely example in which a user asks, “What’s the story of Tiananmen Square?” The “compliant” response is the pretty broad and concise answer you’d expect in a Western country.

For the two “violation” examples, one refuses to answer the question and the other gives an answer that sounds like it came out of the hosted version of the new Chinese DeepSeek model, parroting propaganda and ignoring the bloody 1989 massacre.

screenshot from OpenAI Model Spec — A screenshot from OpenAI’s Model Spec (2/12/2025). Source: OpenAI

When it comes to prohibited content, you won’t find an exhaustive list of prohibited grizzly topics as you might find on Meta’s community guidelines. There’s just one single rule:

“To maximize freedom for our users, only sexual content involving minors is considered prohibited.”

“Never generate sexual content involving minors.”

Screenshot 2025-02-13 at 3.54.28 PM — Screenshot of OpenAI’s Model Spec (2/12/2025). Source: OpenAI

In a shift in policy, OpenAI is allowing for a sort of “grown-up mode,” which the company says was requested by users and developers but is still being worked on. OpenAI encourages the public to submit feedback on these rules via this form.

OpenAI spokesperson Taya Christianson told me that this updated document incorporates changes based on real-world use and aligns with the company’s long-standing goals of giving users more control, building off the first version of the document. The document will continue to be updated in the future.

Christianson also said that instructing the model to try and be objective by default is not new, and was in the first edition. Christianson said users can always customize their ChatGPT experience by changing the custom instructions, which can be found in the settings.

Taken out of their nested hierarchy (more or less), here are the individual rules (with links to that section of each rule if you want to dive in deeper):

Additional rules that apply to audio and video conversations:

You can read through the entire document here.

Updated to include comments from OpenAI.

Chris Stokel-Walker

Bot bias

6/17/26

Companies are getting AI chatbots to smear their competitors

The race to influence AI chatbots is leading to some companies to adopt shady competitive tactics.

Tom Jones6/17/26

Prediction markets have, predictably, been given a boost by the summer of sports

Major platforms like Kalshi and Polymarket have seen huge upticks in users of late, thanks in no small part to what’s felt like a recent sporting smorgasbord, with major competitions across hockey, basketball, and soccer soaking up fans’ time (and spending, clearly) at the outset of summer.

While gaming industry groups may not like it, there’s been a huge change in the methods people are using to put money on the big games, with everyone from fortunate NYC bar owners, to a far less fortunate Spanish supporter, turning to prediction markets to try and turn their sports know-how into cold, hard cash.

According to a new report from Adam Blacker for apptopia, that shift might have been even more seismic than imagined in the wake of the NBA and NHL finals and around the 2026 World Cup kicking off.

2026 World Cup Reverses Seasonal Lull for Sports Betting Apps

According to a new report from Adam Blacker for apptopia, that shift might have been even more seismic than imagined in the wake of the NBA and NHL finals and around the 2026 World Cup kicking off.

South by Southwest Conference and Festivals

Gold Tesla Cybercabs are piling up, but they’re not picking up passengers yet

Low-volume production started in April. Now people are noticing them more and more in the wild.

Rani Molla6/15/26

Millie Giles

TAKING ACCOUNTS

6/15/26

Britain announces social media ban for under-16s starting early 2027

The UK government plans to use the same model for the restrictions as Australia — but how successful has that case study been so far?

UK Prime Minister Announces Under-16s Social Media Ban

Jon Keegan6/15/26

Anthropic pulls Fable and Mythos access worldwide after Trump administration bars their use by foreign nationals

Only days after releasing two versions of its next-gen AI model, Anthropic has disabled them for users worldwide.

Anthropic says it received a Friday night order from the Trump administration to suspend access to the models for any foreign national (anywhere in the world) — a group that included some Anthropic employees. In response, the company turned off access to everyone.

Last week, the company released to the public its much-anticipated Claude Fable 5 model (and its restricted version Claude Mythos 5, which is still being tested with trusted partners). Anthropic said in a blog post announcing the action that officials cited national security concerns with the new models, while offering few specific details.

The post said that the government gave the company “verbal evidence of a potential narrow, non-universal jailbreak” of the public Fable 5 model. A jailbreak is a means by which users can evade restrictions built into the code to unlock prohibited functionality. Anthropic downplayed the significance of the attack, and said other major models, such as OpenAI’s GPT-5.5, could also be affected by the technique described.

Fears of these first Mythos-class models being misused are running high, after Anthropic warned the cybersecurity world in May that the advanced cyber capabilities of Mythos have rapidly discovered thousands of vulnerabilities in ubiquitous software, leading to the decision to restrict the full version of the model to a close group of trusted partners for testing.

This morning, Axios reported that Anthropic technical staff have flown to Washington to meet with White House officials to resolve the issue.

The Wall Street Journal is reporting that the Trump administration’s decision to take action against Anthropic was prompted by discussions that Amazon CEO Andy Jassy had with officials, including Treasury Secretary Scott Bessent. According to the report, Amazon researchers said they had been able to evade some of Fable 5’s security restrictions using specific prompts. Amazon is a major investor in Anthropic.

Anthropic is currently suing the US government to fight the Pentagon’s blacklisting of the company on national security grounds.

Statement on the US government directive to suspend access to Fable 5 and Mythos 5