DNYUZ
No Result
View All Result
DNYUZ
No Result
View All Result
DNYUZ
Home News

Sam Altman Says Oops, They Accidentally Made the New Version of ChatGPT Worse Than the Previous One

January 29, 2026
in News
Sam Altman Says Oops, They Accidentally Made the New Version of ChatGPT Worse Than the Previous One

It’s been a little over three years since the launch of the first commercially-available large language model (LLM) chatbot, OpenAI’s ChatGPT. And though the AI model has certainly made performance gains since it came online, the lackluster performance of recent iterations hasn’t helped the perception that LLMs are hitting a plateau.

Case in point, OpenAI CEO Sam Altman recently conceded that the company had “screwed up” the language capabilities of its latest chatbot iteration, GPT-5.2.

“I think we just screwed that up,” Altman said at a developer town hall on Monday. “We will make future versions of GPT 5.x hopefully much better at writing than 4.5 was.”

Continuing, Altman said that the company chose to focus on ChatGPT’s technical capabilities, perhaps to the detriment of its human-language performance.

“We did decide, and I think for good reason, to put most of our effort in 5.2 into making it super good at intelligence, reasoning, coding, engineering, that kind of thing,” Altman said. “And we have limited bandwidth here, and sometimes we focus on one thing and neglect another.”

The admission raises a high-stakes question: whether frontier AI models can continue to excel at tasks across the board, or if proficiency in one domain will start to come at the expense of a broader skill set.

As Search Engine Journal points out, the release of GPT-5.2 came with a huge emphasis on technical tasks like coding and formatting spreadsheets. Compared to past iterations, there was scarce mention of any writing or creative work at all, a pivot which has left many non-technical users feeling like ChatGPT is hitting a wall.

As data scientist and tech blogger Mehul Gupta pointed out in a review of GPT-5.2, there are plenty of signs that the LLM is backsliding, and some of them aren’t particularly subtle.

These include a “flatter tone,” worse translation capability, inconsistent behavior across tasks, and some major regression in “instant mode,” a setting meant to provide immediate answers to simple questions.

As Gupta writes, it also struggles with real-world tasks. When it comes to evaluating human documents like contracts, mixed-format notes or PDFs, GPT-5.2 “forgot earlier details, contradicted itself, misread cross-references, [and] hallucinated clarifications that didn’t exist.”

“Benchmarks are clean,” Gupta observed. “Real documents are not. 5.2 still struggles with the noise of reality.”

More on ChatGPT: Scientist Horrified as ChatGPT Deletes All His Research

The post Sam Altman Says Oops, They Accidentally Made the New Version of ChatGPT Worse Than the Previous One appeared first on Futurism.

Vance Desperately Clarifies He’s Not Trump’s ‘Fat’ Friend
News

Vance Desperately Clarifies He’s Not Trump’s ‘Fat’ Friend

by The Daily Beast
January 29, 2026

Vice President J.D. Vance hijacked a Cabinet meeting to make one thing clear: he is not President Donald Trump’s mysterious ...

Read more
News

White House Scrambles to Fix Humiliating Math Errors

January 29, 2026
News

Sundown at Sundance

January 29, 2026
News

Bruce Springsteen’s anti-ICE protest song decries Minneapolis killings and ‘King Trump’

January 29, 2026
News

Senator Collins Says ICE Operation in Maine Is Over

January 29, 2026
A Solution for Too Many A’s? Harvard Considers Giving A-Pluses

A Solution for Too Many A’s? Harvard Considers Giving A-Pluses

January 29, 2026
Immigration raids pick up in L.A. as federal tactics shift. Arrests happen in ‘as fast as 30 seconds’

Immigration raids pick up in L.A. as federal tactics shift. Arrests happen in ‘as fast as 30 seconds’

January 29, 2026
An Ethicist ‘in the Scalia Mold’: The Minnesota Judge Blasting ICE

An Ethicist ‘in the Scalia Mold’: The Minnesota Judge Blasting ICE

January 29, 2026

DNYUZ © 2025

No Result
View All Result

DNYUZ © 2025