Tech
Full frame Close up Background Blueberries
(Sunlight7/Getty Images)

GPT-5: “A legitimate PhD level expert in anything” that sucks at spelling and geography

OpenAI spent a lot of time talking about how “smart” GPT-5 is, yet it failed spectacularly at tasks that a second grader could achieve.

Jon Keegan

Yesterday, OpenAI spent a lot of time talking about how “smart” its new GPT-5 model was.

OpenAI cofounder and CEO Sam Altman said of GPT-5: “Its like talking to an expert, a legitimate PhD level expert in anything,” and called it “the most powerful, the most smart, the fastest, the most reliable and the most robust reasoning model that weve shipped to date.”

Demos showed GPT-5 effortlessly creating an interactive simulation to explain the Bernoulli effect, diagnosing and fixing complex code errors, planning personalized running schedules, and “vibecoding” a cartoony 3D castle game. The company touted benchmarks showing how GPT-5 aced questions from a Harvard-MIT mathematics tournament and got high scores on coding, visual problem-solving, and other impressive tasks.

But once the public got its chance to kick GPT-5’s tires, some cracks started to emerge in this image of a superintelligent expert in everything.

AI models are famously bad at spelling different kinds of berries, and GPT-5 is no exception.

I had to try the “blueberry” thing myself with GPT5. I merely report the results.

[image or embed]

— Kieran Healy (@kjhealy.co) August 7, 2025 at 8:04 PM

Another classic failure of being bad at maps persisted with GPT-5 when it was asked to list all the states with the letter R in their names. It even offered to map them, with hilarious results:

My goto is to ask LLMs how many states have R in their name. They always fail. GPT 5 included Indiana, Illinois, and Texas in its list. It then asked me if I wanted an alphabetical highlighted map. Sure, why not.

[image or embed]

— radams (@radamssmash.bsky.social) August 7, 2025 at 8:40 PM

To be fair to OpenAI, this problem isn’t unique to GPT-5, as similar failures were documented with Google’s Gemini.

During the livestream, in an embarrassing screwup, a presentation showed some charts that looked like they had some of the same problems.

If this was all described as a “technical preview,” these kinds of mistakes might be understandable. But this is a real product from a company thats pulling in $1 billion per month. OpenAI’s models are being sold to schools, hospitals, law firms, and government agencies, including for national security and defense.

OpenAI is also telling users that GPT-5 can be used for medical information, while cautioning that the model does not replace a medical professional.

“GPT‑5 is our best model yet for health-related questions, empowering users to be informed about and advocate for their health.”

Why is this so hard?

The reason why such an advanced model can appear to be so capable at complex coding, math, and physics yet fail so spectacularly at spelling and maps is that generative models like GPT-5 are probabilistic systems at their core — they predict the most likely next token based on the volumes of data they have been trained on. They don’t know anything, and they don’t think or have any model of how the world works (though researchers are working on that).

When the model writes its response, you see the thing that has the highest score for what you should see next. But with math and coding, the rules are more strict and the examples its been trained on are consistent, so it has higher accuracy and can ace the math benchmarks with flying colors.

But drawing a map with names or counting the letters in a word are weirdly tough, as it requires a skill the model doesn’t really have and has to figure out step by step from patterns, which can lead to odd results. That’s a simplification of a very complex and sophisticated system, but applies to a lot of the generative-AI technology in use today.

Thats also a whole lot to explain to users, but OpenAI boils those complicated ideas down to a single warning below the text box: “ChatGPT can make mistakes. Check important info.”

OpenAI did not immediately respond to a request for comment.

More Tech

See all Tech
tech
Rani Molla

Amazon raises the price for ad-free Prime Video to $4.99

Amazon is giving consumers more — for more. The e-commerce giant is raising the price of its ad-free Prime Video tier to $4.99 a month, up from $2.99.

On April 10, the service, now rebranded as Prime Video Ultra, will allow more concurrent streams (five instead of three) and up to 100 downloads, up from 25. Ad-free Prime Video had been included with a Prime membership until 2024, when Amazon added ads and began charging $2.99 a month to remove them.

For what it’s worth, ad-free Prime Video is still cheaper than the other increasingly expensive streaming services — if you don’t include the cost of Prime.

For what it’s worth, ad-free Prime Video is still cheaper than the other increasingly expensive streaming services — if you don’t include the cost of Prime.

tech
Rani Molla

Uber relaunches robotaxi service with Hyundai-backed Motional in Las Vegas

What happens in Vegas, keeps happening in Vegas.

Uber users in Las Vegas can now be matched with an electric Motional IONIQ 5 robotaxi along parts of the Strip and at select casinos, resorts, and the Town Square shopping district near the airport, the companies said. For now, each vehicle includes a human safety operator monitoring from behind the wheel, who the companies say will be removed by year’s end.

Uber and Hyundai-backed autonomous tech company Motional previously tested a service there in 2022. “Motional is ready to put our extensive ride hail experience to work with Uber again,” said David Carroll, vice president of commercialization at Motional, which paused its commercial deployments in 2024 to refocus on its core driverless technology after scaling back operations.

This time around, the companies will be joining a much more crowded field. Amazon-owned Zoox has been offering free rides along select destinations on the Strip since last year, and both Tesla’s Robotaxi and Alphabet-owned Waymo have plans to open up shop there in the near future.

Thanks to a spate of recent AV partnerships, Uber, which sold its own autonomous unit back in 2020, is finding itself at the center of the nascent robotaxi boom.

tech
Rani Molla

Musk says “xAI was not built right” amid executive departures, Cursor hires

There’s been a lot of turnover lately at xAI, with numerous executive departures and, yesterday, news that the SpaceX-owned company was hiring two senior leaders from Cursor, an AI coding startup that’s raising funds at a $50 billion valuation.

The reason? “xAI was not built right first time around, so is being rebuilt from the foundations up,” CEO Elon Musk posted on xAI-owned X yesterday, in response to a post about the Cursor hires. Earlier this month, Musk told a conference audience, “Grok is currently behind on coding.”

The news amounts to an admission of a reset inside xAI and an acknowledgment that the company is trailing AI peers like Anthropic and OpenAI in one of AI’s most commercially important applications: coding.

tech
Jon Keegan

War in the Middle East halts Meta’s undersea fiber project

Meta’s massive undersea cable project connecting Africa and the Middle East to Europe has run into an unexpected obstacle — not under the sea, but in the sky and land above: the war in the Middle East.

According to a report from Bloomberg, France’s Alcatel Submarine Networks, the company that is laying the cable, notified customers that it can no longer safely operate in the area.

The 2Africa project consists of a 45,000-kilometer chain of undersea fiber-optic cables that encircles Africa and runs through the Red Sea, up through the Gulf of Oman, where the Strait of Hormuz sits. Iran has declared the strait — a crucial choke point for oil and natural gas tankers — closed for traffic.

Meta is building the network in partnership with Bayobab, China Mobile, Orange, Telecom Egypt, Vodafone, WIOCC, and Center3.

The 2Africa project consists of a 45,000-kilometer chain of undersea fiber-optic cables that encircles Africa and runs through the Red Sea, up through the Gulf of Oman, where the Strait of Hormuz sits. Iran has declared the strait — a crucial choke point for oil and natural gas tankers — closed for traffic.

Meta is building the network in partnership with Bayobab, China Mobile, Orange, Telecom Egypt, Vodafone, WIOCC, and Center3.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Derivatives, LLC, or Robinhood Money, LLC. Futures and event contracts are offered through Robinhood Derivatives, LLC.