Tech
Rani Molla

AI companies are sucking in YouTube subtitles for training


An investigation by Proof News found that major tech companies, including Anthropic, Nvidia, Apple, and Salesforce have been using subtitles from YouTube videos to train their AI models. The training dataset consisted of the subtitles from 173,536 videos from 48,000 channels included content from creators like MrBeast, PewDiePie, TED, and Khan Academy, among others. Those creators didn’t necessarily give permission or get paid. Earlier this year, the New York Times found that OpenAI, which has consistently avoided fessing up, also used YouTube data to train its AI.

173,536
YouTube videos used for AI training
By Anthropic, Nvidia, Apple, & Salesforce
That’s a surprise to the video makers
Like MrBeast, Khan Academy, PewDiePie
“It’s theft,” said one streaming service’s CEO

More Tech

See all Tech

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.