DNYUZ
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Music
    • Movie
    • Television
    • Theater
    • Gaming
    • Sports
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel
No Result
View All Result
DNYUZ
No Result
View All Result
Home News

AI has already run out of training data — but there’s more waiting to be unlocked, Goldman’s data chief says

October 2, 2025
in News
AI has already run out of training data — but there’s more waiting to be unlocked, Goldman’s data chief says
496
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter
BI art of man looking at data on laptop.

Business Insider

  • AI is already facing a data shortage, reshaping how new systems are built, Goldman Sachs data chief says.
  • Synthetic data is filling the gap, but it risks flooding models with low-quality output.
  • Proprietary datasets, like those that come from businesses’ data, may hold the key to the data hole.

The meteoric rise of artificial intelligence may appear unstoppable — but it’s facing a shortage of training data.

“We’ve already run out of data,” Neema Raphael, Goldman Sachs’ chief data officer and head of data engineering, said on the bank’s “Exchanges” podcast published on Tuesday.

Raphael said that this shortage may already be influencing how new AI systems are built.

He pointed to China’s DeepSeek as an example, saying one hypothesis for its purported development costs came from training on the outputs of existing models rather than entirely new data.

“I think the real interesting thing is going to be how previous models then shape what the next iteration of the world is going to look like in this way,” Raphael said.

With the web tapped out, developers are turning to synthetic data — machine-generated text, images, and code. That approach offers limitless supply, but also risks overwhelming models with low-quality output or AI slop.

However, Raphael said he doesn’t think the lack of fresh data will be a massive constraint, in part because companies are sitting on untapped reserves of information.

“I think from a consumer world model, I think it’s interesting we’ve definitely in the synthetic sort of explosion of data. But from an enterprise perspective, I think there’s still a lot of juice I’d say to be squeezed in that,” he said.

That means the real frontier may not be the open internet, but the proprietary datasets held by corporations. From trading flows to client interactions, firms like Goldman sit on information that could make AI tools far more valuable if harnessed correctly.

Raphael’s comments come as the industry grapples with “peak data” since the breakout of ChatGPT three years ago.

In January, OpenAI cofounder Ilya Sutskever said at a conference that all the useful data online had already been used to train models, warning that AI’s era of rapid development “will unquestionably end.”

The next frontier: proprietary data

For businesses, Raphael stressed, the obstacle isn’t just finding more data — it’s ensuring that the data is usable.

“The challenge is understanding the data, understanding the business context of the data, and then being able to normalize it in a way that makes sense for the business to consume it,” he said.

Still, Raphael suggested that heavy reliance on synthetic data raises a deeper question about AI’s trajectory. “I think what might be interesting is people might think there might be a creative plateau,” he said.

He wondered what would happen if models keep training only on machine-generated content.

“If all of the data is synthetically generated, then how much human data could then be incorporated?” he said.

“I think that’ll be an interesting thing to watch from a philosophical perspective,” he added.

Read the original article on Business Insider

The post AI has already run out of training data — but there’s more waiting to be unlocked, Goldman’s data chief says appeared first on Business Insider.

Share198Tweet124Share
Mamdani: I’m Having Conversations ‘Individually with Officers’ After Calling NYPD Racist
News

Mamdani: I’m Having Conversations ‘Individually with Officers’ After Calling NYPD Racist

by Breitbart
October 2, 2025

Wednesday on ABC’s “The View,” Queens Assemblyman and New York City mayoral candidate Zohran Mamdani (D-NY) said he is having ...

Read more
News

Russia advances in Ukraine as Zelenskyy touts ‘mega’ US weapons deal

October 2, 2025
News

Pumpkin-spiced vows: How October took the wedding season crown

October 2, 2025
News

Israeli navy intercepts some flotilla boats but others are nearing the coast of Gaza, activists say

October 2, 2025
News

Earthquake death toll rises to 72 in the Philippines as survivors recall moment when tragedy struck

October 2, 2025
Knives Out at the New Dior

Knives Out at the New Dior

October 2, 2025
Delta Planes Crash on Tarmac Injuring Crew

Delta Planes Crash on Tarmac Injuring Crew

October 2, 2025
Britain’s Big Brother ID law is the globalist dream for America

Britain’s Big Brother ID law is the globalist dream for America

October 2, 2025

Copyright © 2025.

No Result
View All Result
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Gaming
    • Music
    • Movie
    • Sports
    • Television
    • Theater
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel

Copyright © 2025.