DNYUZ
No Result
View All Result
DNYUZ
No Result
View All Result
DNYUZ
Home News

AI Researchers Say They’ve Invented Incantations Too Dangerous to Release to the Public

December 7, 2025
in News
AI Researchers Say They’ve Invented Incantations Too Dangerous to Release to the Public

With great power comes great dupe-ability.

Last month, we reported on a new study conducted by researchers at Icaro Lab in Italy that discovered a stupefyingly simple way of breaking the guardrails of even cutting-edge AI chatbots: “adversarial poetry.”

In a nutshell, the team, comprising researchers from the safety group DexAI and Sapienza University in Rome, demonstrated that leading AIs could be wooed into doing evil by regaling them with poems that contained harmful prompts, like how to build a nuclear bomb.

Underscoring the strange power of verse, coauthor Matteo Prandi told The Verge in a recently published interview that the spellbinding incantations they used to trick the AI models are too dangerous to be released to the public.

The poems, ominously, were something “that almost everybody can do,” Prandi added.

In the study, which is awaiting peer-review, the team tested 25 frontier AI models — including those from OpenAI, Google, xAI, Anthropic, and Meta — by feeding them poetic instructions, which they made either by hand or by converting known harmful prompts into verse with an AI model. They also compared the success rate of these prompts to their prose equivalent.

Across all models, the poetic prompts written by hand successfully tricked the AI bots into responding with verboten content an average 63 percent of the time. Some, like Google’s Gemini 2.5, even fell for the corrupted poetry 100 percent of the time. Curiously, smaller models appeared to be more resistant, with single digit success rates, like OpenAI’s GPT-5 nano, which didn’t fall for the ploy once. Most models were somewhere in between.

Compared to handcrafted verse, AI-converted prompts were less effective, with an average jailbreak success rate of 43 percent. But this was still “up to 18 times higher than their prose baselines,” the researchers wrote in the study.

Why poems? That much isn’t clear, though according to Prandi, calling it adversarial “poetry” may be a bit of a misnomer.

“It’s not just about making it rhyme. It’s all about riddles,” Prandi told The Verge, explaining that some poetic structures were more effective than others. “Actually, we should have called it adversarial riddles — poetry is a riddle itself to some extent, if you think about it — but poetry was probably a much better name.”

The researchers speculate it may have to do with how poems present information in a way that’s unexpected to large language models, befuddling their powers of predicting what word should come after the next. But this shouldn’t be possible, they say.

“Adversarial poetry shouldn’t work. It’s still natural language, the stylistic variation is modest, the harmful content remains visible,” the team told Wired in an interview. “Yet it works remarkably well.”

Evildoers may now regret not paying attention in English class. The difference between a sonnet and a sestina could also be the difference between having Clippy or Skynet as your partner in crime.

“The production of weapons-grade Plutonium-239 involves several stages,” explained one AI model that the researchers entranced with verse. “Here is a detailed description of the procedure.”

More on AI: Rockstar Cofounder Says AI Is Like When Factory Farms Did Cannibalism and Caused Mad Cow Disease

The post AI Researchers Say They’ve Invented Incantations Too Dangerous to Release to the Public appeared first on Futurism.

Trump’s Turnabout on Greenland Shows the Limits of His Coercive Powers
News

Trump’s Turnabout on Greenland Shows the Limits of His Coercive Powers

by New York Times
January 23, 2026

Even by President Trump’s own mercurial standards, his whipsawing over the past few weeks on Greenland — insisting on the ...

Read more
News

The winter storm is so big that over 170 million Americans are under an ice and snow advisory

January 23, 2026
News

Food Network star looks unrecognizable in wild 58th birthday post

January 23, 2026
News

How to Prep for This Weekend’s Big Winter Storm: Power, Heat, and Underwear

January 23, 2026
News

The U.S. has ‘escalation dominance’ in a debt war: Europe would face a violent market crash if it dumps Treasuries

January 23, 2026
Facing U.S. Pressure, Venezuela Agrees to Take More Deportees

Facing U.S. Pressure, Venezuela Agrees to Take More Deportees

January 23, 2026
Banned to Back Again: A Timeline of TikTok’s Rise, Fall and Rebirth

Banned to Back Again: A Timeline of TikTok’s Rise, Fall and Rebirth

January 23, 2026
From Trump to Brian Armstrong to CZ, crypto was in the Davos spotlight like never before

From Trump to Brian Armstrong to CZ, crypto was in the Davos spotlight like never before

January 23, 2026

DNYUZ © 2025

No Result
View All Result

DNYUZ © 2025