tech

Jon Keegan12/20/24

OpenAI announces new frontier models o3 and o3 mini

On the last day of “shipmas,” OpenAI saved what might be the biggest news for last, though the 1-800 number remains the most fun.

In a puzzling branding move, OpenAI CEO Sam Altman announced their latest frontier models: “o3” and “o3-mini.” For some reason (possibly trademark related), they’re skipping “o2” altogether.

The models are not available to the public yet, but researchers can apply to participate in “public safety testing” of the models, which are expected to be widely released at the end of January. The new models feature multistep “reasoning” like the current o1 model, but also apply the process to safety, leading to a higher success rate at catching prohibited responses, according to Altman.

Altman announced the models on a livestream and revealed that the new models had achieved the highest scores on a benchmark test that has been notoriously difficult for AI models to solve.

The ARC-AGI benchmark is a visual test that consists of a series of patterns of squares on a grid, and the model must apply unique solutions to each puzzle, which requires learning new skills with each problem.

Altman said that the o3 model performed 20% better than the current o1 model on coding benchmarks, and highlighted the performance and cost improvements for the smaller o3-mini model.

12 Days of OpenAI

12 Days of OpenAI

The models are not available to the public yet, but researchers can apply to participate in “public safety testing” of the models, which are expected to be widely released at the end of January. The new models feature multistep “reasoning” like the current o1 model, but also apply the process to safety, leading to a higher success rate at catching prohibited responses, according to Altman.

Altman announced the models on a livestream and revealed that the new models had achieved the highest scores on a benchmark test that has been notoriously difficult for AI models to solve.

The ARC-AGI benchmark is a visual test that consists of a series of patterns of squares on a grid, and the model must apply unique solutions to each puzzle, which requires learning new skills with each problem.

Altman said that the o3 model performed 20% better than the current o1 model on coding benchmarks, and highlighted the performance and cost improvements for the smaller o3-mini model.

More Tech

tech

Tesla’s Model Y just cleared a new federal safety bar

The National Highway Traffic Safety Administration announced today that Tesla Model Ys manufactured after November 12 were the first to pass the agency’s new advanced driver assistance system tests, which are now part of the New Car Assessment Program.

“By successfully passing these new tests, the 2026 Tesla Model Y demonstrates the lifesaving potential of driver assistance technologies and sets a high bar for the industry,” NHTSA Administrator Jonathan Morrison wrote in the press release. “We hope to see many more manufacturers develop vehicles that can meet these requirements.”

The new tests include:

Pedestrian automatic emergency braking
Lane-keeping assistance
Blind spot warning
Blind spot intervention

The milestone offers Tesla highly coveted regulatory validation, as it seeks to spur usage of its Full Self-Driving (Supervised) tech. The NHTSA didn’t immediately respond to a request for comment.

We knew Claude Code was driving crazy growth at Anthropic, but it may be much more than the company is expecting.

Speaking at the company’s developer conference yesterday, Anthropic CEO Dario Amodei said that while the company is planning for 10x growth this year, it could be as much as 80x, calling the overwhelming demand “crazy” and that he looked forward to more modest growth, saying such growth is “too hard to handle.”

The demand is so great that Anthropic partnered with Elon Musk’s xAI to buy up the bulk of computing from his Colossus data center in Tennessee.

tech

Tesla’s made-in-China vehicle sales jumped 36% in April

Tesla’s sales of made-in-China vehicles — sold across China, Europe, and other international markets — rose 36% year over year to 79,478 units in April. The increase marks the sixth straight month of annual growth in sales of vehicles made in the world’s largest manufacturing economy, suggesting the EV maker’s overseas business may be stabilizing after a difficult stretch.

That said, China wholesale deliveries fell from March, even as overall new energy vehicle sales rose 7% during the period.

Later this month, the China Passenger Car Association will report China-only sales, offering a clearer picture of performance in Tesla’s second-largest market.

Tesla China April wholesale sales jump year-on-year but slip from March

Tesla China April wholesale sales jump year-on-year but slip from March

Later this month, the China Passenger Car Association will report China-only sales, offering a clearer picture of performance in Tesla’s second-largest market.

tech

Rani Molla5/6/26

Anthropic’s scramble for compute now includes rival xAI

Another day, another major partnership with an AI rival. This time, Anthropic signed a deal with SpaceX’s xAI to access compute from its Colossus 1 data center to help it improve capacity for its Claude Pro and Claude Max subscribers. Just yesterday, The Information reported that Anthropic planned to spend $200 billion on Google Cloud services over the next five years. As Sherwood News’ Luke Kawa wrote:

“Anthropic has been a victim of its own success: the popularity of Claude Code and Cowork have revealed compute constraints and left users frustrated by caps. In response, the Claude developer has embarked upon a mad scramble for compute, striking or expanding deals with CoreWeave, Amazon, Google, and Broadcom.”

Now, it’s adding xAI to the list — even as the Elon Musk company builds a competing model.

In less terrestrial news, xAI said that as part of the agreement, Anthropic “expressed interest in partnering to develop multiple gigawatts of orbital AI compute capacity.”

xAI — Creators of Grok, the AI Chatbot

xAI — Creators of Grok, the AI Chatbot

“Anthropic has been a victim of its own success: the popularity of Claude Code and Cowork have revealed compute constraints and left users frustrated by caps. In response, the Claude developer has embarked upon a mad scramble for compute, striking or expanding deals with CoreWeave, Amazon, Google, and Broadcom.”

Now, it’s adding xAI to the list — even as the Elon Musk company builds a competing model.

In less terrestrial news, xAI said that as part of the agreement, Anthropic “expressed interest in partnering to develop multiple gigawatts of orbital AI compute capacity.”

TEST TIME!

Meet your new interview partner: Claude

Chris Stokel-Walker

Alien Robot Attack

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Derivatives, LLC, or Robinhood Money, LLC. Futures and event contracts are offered through Robinhood Derivatives, LLC.

©2026 Sherwood Media, LLC