(VCG/Getty Images)

The trillion-dollar mystery surrounding DeepSeek’s Nvidia GPUs

There’s a cloud of suspicion hanging over the type and number of Nvidia GPUs DeepSeek used to train its R1 models.

1/29/25 11:23AM

At the center of the story of DeepSeek’s breakthrough achievement with its R1 models lies the Nvidia hardware that powered the servers that trained those models.

In December 2024, DeepSeek researchers released a paper that outlined the development and capabilities of the new DeepSeek-V3 large language model. In the paper, the researchers said they were able to train their powerful, efficient model over 2.78 million GPU hours of computing time on a cluster of only 2,048 Nvidia H800 GPUs. That is a very small number of GPUs for a model that matched or beat OpenAI’s state-of-the-art o1 model in some benchmarks.

For comparison, Meta trained its Llama 3.1 models on two clusters, using a total of 39.3 million GPU hours with 49,152 Nvidia H100 GPUs. Last week, Mark Zuckerberg said that Meta is planning on ending 2025 with over 1.3 million GPUs.

Released in 2023, the H800 is a GPU that’s similar to the H100 but is tailored for the Chinese market to comply with US export controls concerning national security parameters that the Biden administration rolled out in 2022. Reuters reported that the main thing Nvidia changed in the H800 was that it “reduced the chip-to-chip data transfer rate to about half the rate.”

But The Wall Street Journal reports that government officials found the H800 exploited technical loopholes that met the strict requirements of the ban, but still gave Chinese buyers very powerful AI chips. To close the loophole, in October 2023, the US government banned the export of H800s as well.

It appears that DeepSeek was able to acquire its H800s during that short window of availability.

DeepSeek’s claims are drawing suspicion from some observers in the AI industry, but most appear to be just speculation. Scale AI CEO Alexandr Wang told CNBC that he suspected DeepSeek has “about 50,000 H100s, which they can’t talk about obviously because it is against the export controls that the United States has put in place,” and in a tweet, Elon Musk replied, “Obviously.” Musk, meanwhile, has bragged about xAI’s “Colossus supercluster,” which is powered by 100,000 H100 GPUs, and that he plans to scale up to 1 million of the expensive Nvidia chips.

There have been reports of H100s being smuggled into China through a series of intermediaries on the black market, but no evidence that DeepSeek did so.

Adding to the confusion, DeepSeek cofounder Liang Wenfeng said that the company does own a cluster of 10,000 Nvidia A100 GPUs, a cheaper and less powerful AI chip.

The H100 has earned a status of being one of the most coveted pieces of computer hardware in the AI age. Even when other chips are used, the power is sometimes expressed as a number of “H100-equivalent” GPUs.

Nvidia is in the process of rolling out its next-gen H200 Blackwell GPUs, and last year CEO Jensen Huang hand-delivered the first DGX H200 server to OpenAI headquarters.

Jon Keegan5h

Anthropic projections for 2028: Up to $70 billion in revenue, could be profitable by 2027

Anthropic’s Claude API business is doing so well with enterprise customers, the company is upping its revenue forecasts significantly. According to a report from The Information, the company’s robust corporate sales have caused it to revise its most optimistic forecast up to $70 billion in sales by 2028.

Anthropic estimates its API business will be double that of OpenAI’s API sales. OpenAI is currently burning through much more money per month than Anthropic, and reportedly expects to spend as much as $115 billion through 2029, while Anthropic is forecasting that it could be cash positive by 2027, per the report.

Anthropic Projects $70 Billion in Revenue, $17 Billion in Cash Flow in 2028

Rani Molla6h

Amazon, which is developing AI shopping agents, doesn’t want Perplexity’s AI shopping agents on its site

Amazon has sent a cease and desist letter to Perplexity AI, demanding that it stop letting its AI browser agent, Comet, make online purchases for users, Bloomberg reports.

Amazon, which is developing its own AI shopping agents and is having “conversations” with builders of third-party agents, accused the AI startup of “committing computer fraud by failing to disclose when its AI agent is shopping on a user’s behalf, in violation of Amazon’s terms of service.”

Perplexity, in response, said Amazon is attempting to “eliminate user rights” in order to sell more ads.

Amazon Demands Perplexity Stop AI Agent From Making Purchases

Perplexity, in response, said Amazon is attempting to “eliminate user rights” in order to sell more ads.

Jon Keegan6h

Apple to challenge Google Chromebooks with low-cost Mac laptop, Bloomberg reports

Apple is designing a new sub-$1,000 Mac laptop aimed at the education market, Bloomberg reports.

Google’s low-cost Chromebooks currently dominate the K-12 education market, and Apple’s reentry into the education market that it once owned could disrupt the sector’s status quo.

According to the report, Apple plans on using the custom mobile chips it currently uses in iPhones to power the more affordable devices.

Apple’s recent earnings demonstrated that iPhone sales have been steady, and the tech giant is looking to find new areas of growth, like services. A low-cost Mac could be popular with consumers, in addition to education buyers.

Apple Prepares to Enter Low-Cost Laptop Market for First Time

According to the report, Apple plans on using the custom mobile chips it currently uses in iPhones to power the more affordable devices.

Jon Keegan7h

Getty Images suffers partial defeat in UK lawsuit against Stability AI

Stability AI, the creator of image generation tool Stable Diffusion, largely defended itself from a copyright violation lawsuit filed by Getty Images, which alleged the company illegally trained its AI models on Getty’s image library.

Lacking strong enough evidence, Getty dropped the part of the case alleging illegal training mid-trial, according to Reuters reporting.

Responding to the decision, Getty said in a press release:

“Today’s ruling confirms that Stable Diffusion’s inclusion of Getty Images’ trademarks in AI‑generated outputs infringed those trademarks. ... The ruling delivered another key finding; that, wherever the training and development did take place, Getty Images’ copyright‑protected works were used to train Stable Diffusion.”

Stability AI still faces a lawsuit from Getty in US courts, which remains ongoing.

A number of high-profile copyright cases are still working their way through the courts, as copyright holders seek to win strong protections for their works that were used to train AI models from a number of Big Tech companies.

Getty Images largely loses landmark UK lawsuit over AI image generator

Responding to the decision, Getty said in a press release:

“Today’s ruling confirms that Stable Diffusion’s inclusion of Getty Images’ trademarks in AI‑generated outputs infringed those trademarks. ... The ruling delivered another key finding; that, wherever the training and development did take place, Getty Images’ copyright‑protected works were used to train Stable Diffusion.”

Stability AI still faces a lawsuit from Getty in US courts, which remains ongoing.

Rani Molla

Waymo Business

10h

Uber says it’s doing better in markets where it has autonomous vehicles

It’s autonomous ride-sharing business is still very small.