Tech

Look who’s talking

Shazam for babies

GettyImages-158093063 baby cry
(Getty Images)

Do baby cry translators work?

Apps aim to help overwhelmed new parents decode the meaning of their baby’s wails.

What if babies could talk?

It’s one of the great unanswered questions of our time — a muse to some of humankind’s foremost thinkers (the creators of “The Boss Baby,” etc.) — and we may be closer to an answer than ever before.

AI baby translation, specifically of babies’ cries (one of the main sounds babies make), is on the rise. An app store search for “baby translator” nets dozens of products aimed at helping frazzled new parents decode the meaning behind their baby’s wails.

The tech works like Shazam, the app that identifies music. You record your baby crying, the AI cross-references its dataset of labeled cries, and voila — you’re presented with a translation.

Some apps, like Nanni AI and Cappella, have a translation feature within a larger “parent assistant” program which features monitors, sleep trackers, and feeding logs. ChatterBaby is a translator built out of research by UCLA’s institute for neuroscience. Others range from glorified white-noise apps to a parody tool that translates baby noises into quirky phrases you’d see on a onesie, like “don’t talk to me until I’ve had my bottle.”

These translators, often free to download, make money in a variety of ways, from premium-tier subscriptions (Cappella’s translator is free, but users who want to use milestone-tracking features pay $10/month) to research grants and more traditional funding (Nanni AI’s parent company, Ubenwa Health, received $2.5 million in funding in 2022).

For new parents, translating their baby’s cries has a natural draw as they look for any way to confirm what their nonverbal offspring needs or wants. Baby cries are evolutionarily designed to make humans stress out, and parents trying to learn their newborn’s “language” aren’t helped by sleep deprivation.

The teams behind AI baby translators seek to make the process of understanding what babies want simpler. Unlike larger AI models trained on — essentially — everything, AI baby translators are trained only on labeled audio recordings of infants crying. The quality of that foundational “Monsters, Inc.”-ian cry data is key to any given app’s reliability.

Quality cries are hard to come by

“We actually had to create fake cry detectors to weed out all of the adults pretending to be babies crying,” said Ariana Anderson, founder of ChatterBaby at UCLA. Anderson had a team of researchers analyze one available database of baby cries used by some translation apps (which claimed to analyze cries with 99% accuracy) and found that all the cries labeled as “gassy” were actually just some guy talking.

“As they always say in AI, ‘bad data, bad model,’” Apolline Deroche, founder of Cappella, said. She said the Cappella team made early mistakes, like collecting cries recorded by parents in homes and purchasing datasets from other baby-translation companies. “This one dataset that we bought, we realized that only 7% of it was actually baby cries. The other 93% was TV, background noise, and people talking.” 

Deroche said Cappella’s current cry-collection process is much more rigorous. Doctors and nurses at two partnering Bucharest hospitals record a cry, look for seven identifiers before labeling it, and then have a second nurse or doctor listen to confirm or reject the label before adding it to Cappella’s database.

There’s another core issue with the baby-translation business: babies learn fast. Deroche said Cappella’s baby-translation tech is reliable only until babies are six months old. Soon, users will be automatically downgraded and lose the translation tool after six months, but retain other features like monitoring and tracking milestones. 

Charles Onu, founder of Ubenwa Health and creator of Nanni AI (which says it has analyzed 1.5 million cries from 140,000 users since the app launched this year), said the goal was to eventually go into hospitals commercially as a tool to aid in diagnosis.

No consensus on the legitimacy of baby translation 

How legit are baby translators? Good question.

Research dating to the 1960s seems to generally agree that, one, adults with lots of experience with babies (doctors, nurses, parents) are better at deciphering the meaning behind newborn cries than others, and, two, there’s a limit to how much meaning babies are passing along when they wail.

The most reputable cry translators keep their interpretations relatively simple, separating cries into categories like pain, hunger, tiredness, or discomfort. Some apps get more specific, providing translations like “earache” or “diaper change.” But Barry Lester, a professor at Brown University, colic expert, and author of “Why Is My Baby Crying?,” said that in his decades of research, there are only two kinds of baby cries that’ve been identified reliably: pain cries and cries for everything else.

“This idea that a baby cries differently when they're hungry, or bored, or sleepy, or any of that stuff, is just crap,” Lester said.

Lester has been studying infant cries for more than half a century and has developed an acoustic cry-analysis system. In his office, Lester has a collection of decades’ worth of bogus-baby-cry tech — he calls it his “cry museum.” From a tool covered in baby faces that lights up an appropriate face correlated to the cry type (Lester said it’s stupid and doesn’t work) to a product the FDA asked him to evaluate that plugs up a baby’s mouth to “absorb” loud cries (he did not give it his stamp of approval), Lester is deeply skeptical of any infant tech making bold claims.

“It can do a lot more harm than good if we’re relying on an AI tool to tell us whether to feed our baby.”

ChatterBaby’s Anderson echoed that idea. ChatterBaby offers a limited batch of cry translations, and its 90% translation accuracy claim is specific to pain. Parents, Anderson said, should be wary of apps that promise too much.

“There's a big problem in this field where there's a lot of snake oil and bad science going on,” Anderson said. “It can do a lot more harm than good if we’re relying on an AI tool to tell us whether to feed our baby.”

Research has shown that, broadly, AI isn’t reliable at reading human emotions. A good test of a cry translator’s legitimacy is to see if it's claiming to interpret emotions newborn babies can’t have yet.

“We’ll see some AI baby translators which will claim with a straight face ‘this baby is bored,’” Anderson said. “Well, cognitively, a baby is not able to be bored when they are zero to 3 months old. So if you have tools predicting things which cannot exist in newborns, you automatically know that it’s not based in science.”

For new parents frustrated by the limitations of baby translation and searching for help, Lester encourages trusting your intuition.

“Our species is pretty damn good at carrying on and reproducing, and parenting is built into us,” he said. He thinks these devices impede the parent-newborn relationship. “My advice to new parents is to pay attention to the baby’s signals and cues and try and figure out what the kid is saying. They can figure it out. You will figure it out.”

As for other baby sounds like gurgling and babbling, sorry, nobody knows what the hell they’re trying to say.

More Tech

See all Tech
tech
Rani Molla

Amazon to lay off thousands more office workers on path to 30,000 cuts

Amazon plans to axe thousands of corporate workers next week, after laying off 14,000 back in October, according to Reuters. The new cuts could be “roughly the same” number as last time and may hit Amazon Web Services, retail, Prime Video, and human resources, the report said, citing people familiar with the matter.

The company plans to cut a total of 30,000 corporate positions as part of an effort to “streamline operations and reset its culture,” Business Insider reported separately, noting comments from CEO Andy Jassy, who said the earlier layoffs were “about culture” rather than AI-related cost cutting.

The company plans to cut a total of 30,000 corporate positions as part of an effort to “streamline operations and reset its culture,” Business Insider reported separately, noting comments from CEO Andy Jassy, who said the earlier layoffs were “about culture” rather than AI-related cost cutting.

Little  Bay Beach

There are now more than 1 million “.ai” websites, contributing an estimated $70 million to Anguilla’s government revenue last year

Data from Domain Name Stat reveals that the top-level domain originally assigned to the British Overseas Territory of Anguilla passed the milestone in early January.

tech

TikTok closes deal to operate in the US

TikTok has finally sealed its deal to establish a majority American-owned joint venture to manage its US operations.

On Friday, the social media company announced that its US arm will now be led by three “managing investors” — Silver Lake, Oracle, and MGX, each with a 15% holding — while ByteDance retains 19.9% of the business, and a swath of other investors, including Michael Dell’s family office, round out the cap table.

The joint venture will be operated by a seven-person majority American board of directors, which includes TikTok CEO Shou Chew, with Adam Presser, previously TikTok’s head of operations, trust, and safety, as its CEO.

Though the valuation of the new venture has not been shared, Vice President JD Vance has previously cited the market value of TikTok’s US operations at about $14 billion, just topping Snap and lower than Pinterest.

The deal closes the platform’s battle, which kicked off in earnest in August 2020 when President Donald Trump first tried to ban TikTok over national security concerns. The announcement notes that the new TikTok USDS Joint Venture LLC will “secure U.S. user data, apps and the algorithm.” Trump celebrated the deal, which has been signed off by both the US and Chinese governments, per Reuters, in a Truth Social post, saying TikTok “will now be owned by a group of Great American Patriots and Investors, the Biggest in the World.”

tech
Rani Molla

Elon Musk says Tesla Robotaxis are operating without drivers, sending stock higher

Tesla CEO Elon Musk said that Tesla’s Robotaxis are now operating in Austin without a safety monitor. Tesla has been testing driverless cars in the area for about a month, and Musk had previously said the company would remove safety drivers by the end of 2025.

It’s unclear how many exactly of the roughly 50 Robotaxis the company operates in the area don’t have drivers. Tesla is “starting with a few unsupervised vehicles mixed in with the broader robotaxi fleet with safety monitors, and the ratio will increase over time,” Ashok Elluswamy, Tesla’s head of AI, posted shortly after Musk. Ethan McKenna, the person behind Robotaxi Tracker, estimates it’s two or three vehicles.

What is clear is that the move is good for Tesla’s stock, which is currently up 3.5%, extending its gains after Musk’s tweet. Morgan Stanley said yesterday that it considers the removal of safety drivers a “precursor to personal unsupervised FSD rollout.” Unsupervised Full Self-Driving is widely considered to be integral to the would-be autonomous company’s value proposition.

At the World Economic Forum earlier on Thursday, Musk said, “Self-driving cars is essentially a solved problem at this point.”

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.