Tech
DeepSeek And Nvidia Logos
(VCG/Getty Images)

The trillion-dollar mystery surrounding DeepSeek’s Nvidia GPUs

There’s a cloud of suspicion hanging over the type and number of Nvidia GPUs DeepSeek used to train its R1 models.

At the center of the story of DeepSeek’s breakthrough achievement with its R1 models lies the Nvidia hardware that powered the servers that trained those models.

In December 2024, DeepSeek researchers released a paper that outlined the development and capabilities of the new DeepSeek-V3 large language model. In the paper, the researchers said they were able to train their powerful, efficient model over 2.78 million GPU hours of computing time on a cluster of only 2,048 Nvidia H800 GPUs. That is a very small number of GPUs for a model that matched or beat OpenAI’s state-of-the-art o1 model in some benchmarks.

For comparison, Meta trained its Llama 3.1 models on two clusters, using a total of 39.3 million GPU hours with 49,152 Nvidia H100 GPUs. Last week, Mark Zuckerberg said that Meta is planning on ending 2025 with over 1.3 million GPUs.

Released in 2023, the H800 is a GPU thats similar to the H100 but is tailored for the Chinese market to comply with US export controls concerning national security parameters that the Biden administration rolled out in 2022. Reuters reported that the main thing Nvidia changed in the H800 was that it “reduced the chip-to-chip data transfer rate to about half the rate.”

But The Wall Street Journal reports that government officials found the H800 exploited technical loopholes that met the strict requirements of the ban, but still gave Chinese buyers very powerful AI chips. To close the loophole, in October 2023, the US government banned the export of H800s as well.

It appears that DeepSeek was able to acquire its H800s during that short window of availability.

DeepSeek’s claims are drawing suspicion from some observers in the AI industry, but most appear to be just speculation. Scale AI CEO Alexandr Wang told CNBC that he suspected DeepSeek has “about 50,000 H100s, which they can’t talk about obviously because it is against the export controls that the United States has put in place,” and in a tweet, Elon Musk replied, “Obviously.” Musk, meanwhile, has bragged about xAI’s “Colossus supercluster,” which is powered by 100,000 H100 GPUs, and that he plans to scale up to 1 million of the expensive Nvidia chips.

There have been reports of H100s being smuggled into China through a series of intermediaries on the black market, but no evidence that DeepSeek did so.

Adding to the confusion, DeepSeek cofounder Liang Wenfeng said that the company does own a cluster of 10,000 Nvidia A100 GPUs, a cheaper and less powerful AI chip.

The H100 has earned a status of being one of the most coveted pieces of computer hardware in the AI age. Even when other chips are used, the power is sometimes expressed as a number of “H100-equivalent” GPUs.

Nvidia is in the process of rolling out its next-gen H200 Blackwell GPUs, and last year CEO Jensen Huang hand-delivered the first DGX H200 server to OpenAI headquarters.

More Tech

See all Tech
15

Tesla’s Robotaxi program has disclosed its 15th accident, Electrek reports, citing the latest filing from the National Highway Traffic Safety Administration. According to Electrek’s estimation, extrapolated from the last time Tesla disclosed mileage figures, that amounts to a crash every 57,000 miles — about 9 times the rate for humans.

The latest crash involved a Model Y hitting a fixed object at 9 mph in January while the autonomous system was engaged.

Humans are very much still involved with Tesla’s so-called autonomous driving service. Despite announcing in January that the service had started removing safety monitors from the front seats, only two unsupervised vehicles have been spotted in the last month according to Robotaxi Tracker. The entire fleet has also dwindled from around 50 vehicles to just 35. Their mileage is unavailable.

tech

Meta’s reported 20% layoff could bring headcount to its lowest level since 2021

Meta is rising Monday morning after Reuters reported the tech giant is planning to lay off 20% of its employees in an effort to use AI to make its workforce more efficient and offset its surging AI capex costs.

On the company’s last earnings call, CEO Mark Zuckerberg touted 30% efficiency gains for its software engineers and said some “power users” of the company’s AI coding tools saw productivity jump as high as 80% — what some saw as a veiled threat to employees who failed to use AI to boost their output.

Meta’s headcount was nearly 79,000 last quarter, having steadily risen since its layoffs during the self-described “year of efficiency” in 2023. A 20% cut would bring headcount to around 63,000 — the company’s lowest level since 2021.

Shares were recently up 2.7%.

Meta’s headcount was nearly 79,000 last quarter, having steadily risen since its layoffs during the self-described “year of efficiency” in 2023. A 20% cut would bring headcount to around 63,000 — the company’s lowest level since 2021.

Shares were recently up 2.7%.

tech

Report: Amid safety failures, ChatGPT’s planned “adult mode” caused concern within OpenAI, with minors misclassified as adults 12% of the time

Despite a series of alarming mental health safety failures that resulted in ChatGPT users allegedly using the product to plan suicides and murder, OpenAI decided to double down on its plan to roll out an “adult mode,” allowing the AI chatbot to produce erotic content.

That decision raised alarms within the company, warning that users could develop unhealthy emotional dependence on the chatbot and that the new age estimation feature was imperfect — and therefore likely to allow minors to access the feature — according to a new report from The Wall Street Journal. Per the report, some 12% of the time, the age estimation feature mistakenly classified minors as adults.

OpenAI’s council of mental health experts were “furious” and unanimous in their opposition to the plans to move forward with the adult mode feature after they were told about the decision in January, with concerns about creating a “sexy suicide coach.”

Earlier this month, the company said it would delay the new feature to focus on other products.

That decision raised alarms within the company, warning that users could develop unhealthy emotional dependence on the chatbot and that the new age estimation feature was imperfect — and therefore likely to allow minors to access the feature — according to a new report from The Wall Street Journal. Per the report, some 12% of the time, the age estimation feature mistakenly classified minors as adults.

OpenAI’s council of mental health experts were “furious” and unanimous in their opposition to the plans to move forward with the adult mode feature after they were told about the decision in January, with concerns about creating a “sexy suicide coach.”

Earlier this month, the company said it would delay the new feature to focus on other products.

tech
Rani Molla

Amazon raises the price for ad-free Prime Video to $4.99

Amazon is giving consumers more — for more. The e-commerce giant is raising the price of its ad-free Prime Video tier to $4.99 a month, up from $2.99.

On April 10, the service, now rebranded as Prime Video Ultra, will allow more concurrent streams (five instead of three) and up to 100 downloads, up from 25. Ad-free Prime Video had been included with a Prime membership until 2024, when Amazon added ads and began charging $2.99 a month to remove them.

For what it’s worth, ad-free Prime Video is still cheaper than the other increasingly expensive streaming services — if you don’t include the cost of Prime.

For what it’s worth, ad-free Prime Video is still cheaper than the other increasingly expensive streaming services — if you don’t include the cost of Prime.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Derivatives, LLC, or Robinhood Money, LLC. Futures and event contracts are offered through Robinhood Derivatives, LLC.