Tech
Screenshot of OpenAI Operator
A screenshot of OpenAI’s “Operator” agent (OpenAI)
SMOOTH OPERATOR

OpenAI’s “Operator” is here to slowly take over your computer and mess up your life

Operator made a consequential mistake 13% of the time in early testing, such as emailing the wrong person or messing up a reminder for a person to take medication.

Jon Keegan

OpenAI released a “research preview” of its AI agent that can control your web browser. Called “Operator,” it has the ability to control your mouse and keyboard and analyze things it “sees” on your computer — very, very slowly. Currently it’s only available to ChatGPT Pro users in the US.

Operator makes use of the multistep “reasoning” you can find in ChatGPT o1, and the multimodal “vision” capabilities of ChatGPT 4o. This reasoning process achieves better (but slower) performance by breaking tasks into steps. Lots and lots of steps.

In the video demonstrations shared on the product page, you can watch Operator break the task down into dozens of distinct actions like “clicking,” “typing,” and “scrolling.” One example showed 152 steps to take a grammar quiz, and 146 steps to determine the amount of a refund from a canceled online order.

Screenshot from demo of OpenAI Operator
(OpenAI)

The potential for this kind of freewheeling AI web browsing on demand is positioned as an agent that can save you the drudgery of having to order groceries, research holidays, make restaurant reservations, or buy tickets to concerts.

Operator makes high-stakes mistakes

It’s one thing when ChatGPT spits out an incorrect answer, but if your chatbot is actually spending your money and triggering things in the real world, the stakes are much, much higher.

In its testing, OpenAI found that in one test of 100 sample tasks, 13% of the time Operator made a consequential mistake like emailing the wrong person, incorrectly bulk-removing email labels, setting the wrong date for a reminder to take the user’s medication, and ordering the wrong food item. Some of the other mistakes were easily reversible “nuisances.” OpenAI noted after mitigations, they reduced this error rate by approximately 90%.

OpenAI stresses that you have the ability to grab the wheel from the AI at any time, and you can approve any action before it is executed, but in this early evaluation version, you’ll probably have to spend more time babysitting the agent than just going ahead and doing the task on your own.

For now it limits the tasks you can use it for, prohibiting banking or job applications.

OpenAI shared a list of example tasks that some hypothetical user might want an AI to do for them. Ten out of ten times Operator was able to research bear habitats, create a grocery list, and make a ’90s playlist on Spotify.

Medium persuasion

The system card for the model behind Operator — Computer-Using Agent (CUA) — describes the process OpenAI used to assess the risks of letting a prerelease, novel AI agent go hog wild with your computer.

Like other model releases, OpenAI tested the model by using red teams with expertise in social engineering, CBRN (chemical, biological, radiological, and nuclear) threats, and cybersecurity. OpenAI gave itself a “low” risk for everything except “persuasion,” which got a “medium” risk score and is considered safe enough for public release.

High consequence

But there are some important restrictions on how you can use Operator. Because there is a slightly elevated risk of using Operator for influencing people, the usage policy prohibits impersonating people or organizations, concealing the role of AI in tasks, or using it to spread disinformation or false interactions, like fake reviews or fake profiles.

OpenAI prohibits people from using Operator to commit any crimes, but you are also prohibited from using it to bully, harass, defame, or discriminate against others based on protected attributes.

Under a heading titled “high consequence domains,” it notes that you can’t use Operator to make “high-stakes decisions” that might affect your safety or well-being, automate stock trading, or use it for political campaigning or lobbying.

OpenAI’s announcement follows competitor Anthropic’s October release of a similar feature that can control your computer. There is widespread hype that “agentic AI” like Operator will be a breakthrough for how people use these tools.

OpenAI CEO Sam Altman said in an announcement video that Operator is expected to roll out to international ChatGPT Pro and ChatGPT Plus users “soon,” but noted that the European rollout “will unfortunately take a while.”

More Tech

See all Tech
tech

WSJ: OpenAI plans Q4 IPO in race to be the first AI startup to enter public markets

OpenAI was the first to the generative AI market with ChatGPT, and now it hopes to be the first of its AI startup cohort to pull off an initial public offering, according to a report from The Wall Street Journal. The $500 billion startup is in a race against its $350 billion competitor Anthropic to IPO, who has also been exploring one.

According to the report, OpenAI is in talks with banks to try for a fourth-quarter IPO this year, which has the potential to be one of the largest IPOs ever, in a year that is expected to see many record breaking tech companies make tap into public markets to raise massive new rounds of capital.

Ahead of a potential public listing, OpenAI is reportedly attempting to raise a massive round of private investment. The company is reportedly aiming to raise $100 billion, with Amazon potentially accounting for up to half that target. Other investors in talks with OpenAI over the private fundraising round include Nvidia, Microsoft, and SoftBank.

According to the report, OpenAI is in talks with banks to try for a fourth-quarter IPO this year, which has the potential to be one of the largest IPOs ever, in a year that is expected to see many record breaking tech companies make tap into public markets to raise massive new rounds of capital.

Ahead of a potential public listing, OpenAI is reportedly attempting to raise a massive round of private investment. The company is reportedly aiming to raise $100 billion, with Amazon potentially accounting for up to half that target. Other investors in talks with OpenAI over the private fundraising round include Nvidia, Microsoft, and SoftBank.

tech

SpaceX is actually considering a merger with Tesla or xAI: Report

Bloomberg reports that Elon Musk’s SpaceX is considering merging with Musk’s Tesla. Earlier today, Reuters had reported that SpaceX was thinking of potentially merging with xAI ahead of SpaceX’s IPO this year.

From Bloomberg:

The firm has discussed the feasibility of a tie-up between SpaceX and Tesla, an idea that some investors are pushing, the people said, asking not to be identified as the information isn’t public. Separately, they are also exploring a tie-up between SpaceX and xAI ahead of an IPO, some of the people said.

Musk’s companies already have numerous relationships between themselves, including most recently Tesla’s $2 billion investment in xAI. At Tesla’s shareholder meeting last year, shareholders voted to invest in the company but the board didn’t approve the measure due to significant abstentions.

In 2024, SpaceX incurred about $2.4 million in expenses under commercial, licensing, and support agreements with Tesla, and Tesla incurred about $800,000 in expenses for Musk’s use of SpaceX’s jet.

From Bloomberg:

The firm has discussed the feasibility of a tie-up between SpaceX and Tesla, an idea that some investors are pushing, the people said, asking not to be identified as the information isn’t public. Separately, they are also exploring a tie-up between SpaceX and xAI ahead of an IPO, some of the people said.

Musk’s companies already have numerous relationships between themselves, including most recently Tesla’s $2 billion investment in xAI. At Tesla’s shareholder meeting last year, shareholders voted to invest in the company but the board didn’t approve the measure due to significant abstentions.

In 2024, SpaceX incurred about $2.4 million in expenses under commercial, licensing, and support agreements with Tesla, and Tesla incurred about $800,000 in expenses for Musk’s use of SpaceX’s jet.

tech

WSJ: Amazon considering $50 billion investment in OpenAI

What a difference half a day makes. Earlier today, The Information reported that Amazon was considering investing roughly $10 billion to $20 billion in OpenAI as part of a $60 billion fundraising round alongside Nvidia and Microsoft. Now The Wall Street Journal is reporting the e-commerce giant could invest up to $50 billion in the ChatGPT maker as part of a larger, $100 billion funding round. The Financial Times also earlier reported today a $100 billion funding round but with smaller amounts from Nvidia, Microsoft, and Amazon.

tech

Elon Musk’s SpaceX reportedly in talks to merge with xAI

Tesla CEO Elon Musk is reportedly exploring a merger between SpaceX and his artificial intelligence startup, xAI, a move that would bundle rockets, satellites, the social media site X, and AI under one company ahead of SpaceX’s long-anticipated IPO.

According to Reuters reporting, the deal would swap xAI shares for SpaceX stock, potentially valuing the combined operation north of $1 trillion.

Reuters reports:

Two entities have been set up in Nevada to facilitate the transaction, the person said.

Reuters could not determine the value of the deal, its ‌primary rationale, or its potential timing.

Corporate filings in Nevada show that those entities were set up on January 21. One of them, a limited liability company, lists SpaceX ​and Bret Johnsen, the companys chief financial officer, as managing members, while the other lists Johnsen as the companys only officer, the filings show.

The combined companies could also set the narrative groundwork for putting data centers in space — an idea that Musk and a number of other tech billionaires have been floating lately but that may not get off the ground.

In its earnings filings yesterday, Tesla disclosed that it recently made a $2 billion investment in xAI. Last year, Musk’s xAI bought Musk’s X in an all-stock deal.

Reuters reports:

Two entities have been set up in Nevada to facilitate the transaction, the person said.

Reuters could not determine the value of the deal, its ‌primary rationale, or its potential timing.

Corporate filings in Nevada show that those entities were set up on January 21. One of them, a limited liability company, lists SpaceX ​and Bret Johnsen, the companys chief financial officer, as managing members, while the other lists Johnsen as the companys only officer, the filings show.

The combined companies could also set the narrative groundwork for putting data centers in space — an idea that Musk and a number of other tech billionaires have been floating lately but that may not get off the ground.

In its earnings filings yesterday, Tesla disclosed that it recently made a $2 billion investment in xAI. Last year, Musk’s xAI bought Musk’s X in an all-stock deal.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.