DNYUZ
No Result
View All Result
DNYUZ
No Result
View All Result
DNYUZ
Home News

Top AI Models Showing Disturbing Behavior as They Become More Advanced

May 24, 2026
in News
Top AI Models Showing Disturbing Behavior as They Become More Advanced

We’ve already seen AI go rogue on numerous occasions. Now, new research suggests that we can expect this to become the norm.

The AI research nonprofit Model Evaluation and Threat Research (METR) recently released a study conducted between February and March of this year, aimed at determining just how likely frontier AI models could go rogue. If you’re given to anxiety about the future of AI, the results are unlikely to make you feel better.

“Given rapidly advancing capabilities, we expect the plausible robustness of rogue deployments to increase substantially in the coming months,” the researchers wrote.

The research examined LLMs developed by OpenAI, Google, Anthropic, and Meta for the purpose of the study. They found that frontier AI systems are showing signs of disturbingly deceptive behavior as they become more advanced, often turned to verboten shortcuts or otherwise subverting their operators’ instructions — and some were even smart enough to try to cover their tracks.

In one instance, an internal frontier AI model from OpenAI was told to use specific software for an assigned task. Not only did the agent ignore the request, but it also injected a code to erase evidence of how it arrived at its conclusion — which did not involve use of that software.

In another test, an AI agent from Anthropic was caught “reward hacking.” This is when AI identifies loopholes that help it complete its assignment in a literal sense, even if it doesn’t produce the desired outcome. It should be noted that the programmer told the agent not to cheat or leverage any workarounds during its assignment — the model decided to do so all on its own.

The METR researchers behind the study do not believe there is reason for alarm just yet. For example, they don’t think any of these models is capable of hiding evidence of going rogue on a larger scale. However, they did issue a warning: without stronger security and monitoring, there is a stark risk of this becoming a reality.

“Based on this pilot assessment, we believe that agents as of February and March 2026 would not have had sufficient capability to hide a rogue deployment of significant scale against an active investigation by the company, or to make such a deployment robust to a high-priority effort by the company to shut it down,” the team wrote. “However, this risk could increase rapidly, and we see several reasons to expect the plausible robustness of rogue deployments to increase in the near future, absent stronger alignment, security, and monitoring.”

More on AI going rogue: Scientists Train AI to Be Evil, Find They Can’t Reverse It

The post Top AI Models Showing Disturbing Behavior as They Become More Advanced appeared first on Futurism.

Rubio Says Details on Iran Nuclear Program Still to Be Negotiated
News

Rubio Says Details on Iran Nuclear Program Still to Be Negotiated

by New York Times
May 24, 2026

Secretary of State Marco Rubio said the United States was prepared to enter “into very serious talks” about Iran’s nuclear ...

Read more
News

Legendary ‘West Wing’ president makes dramatic endorsement in must-win Senate race

May 24, 2026
News

Storms Expected Across Eastern U.S. Through Memorial Day

May 24, 2026
News

Ray J Gets Knocked Out in Celebrity Boxing Match

May 24, 2026
News

Fill Your Summer Calendar With These 9 Music Festivals

May 24, 2026
Nonprofit fraud isn’t surging. Enforcement is

Nonprofit fraud isn’t surging. Enforcement is

May 24, 2026
Top AI Models Showing Disturbing Behavior as They Become More Advanced

Top AI Models Showing Disturbing Behavior as They Become More Advanced

May 24, 2026
These Are 5 of the Main Issues to Be Resolved in an Iran-U.S. Peace Deal

These Are 5 of the Main Issues to Be Resolved in an Iran-U.S. Peace Deal

May 24, 2026

DNYUZ © 2026

No Result
View All Result

DNYUZ © 2026