DNYUZ
No Result
View All Result
DNYUZ
No Result
View All Result
DNYUZ
Home News

AI Researchers Say They’ve Invented Incantations Too Dangerous to Release to the Public

December 7, 2025
in News
AI Researchers Say They’ve Invented Incantations Too Dangerous to Release to the Public

With great power comes great dupe-ability.

Last month, we reported on a new study conducted by researchers at Icaro Lab in Italy that discovered a stupefyingly simple way of breaking the guardrails of even cutting-edge AI chatbots: “adversarial poetry.”

In a nutshell, the team, comprising researchers from the safety group DexAI and Sapienza University in Rome, demonstrated that leading AIs could be wooed into doing evil by regaling them with poems that contained harmful prompts, like how to build a nuclear bomb.

Underscoring the strange power of verse, coauthor Matteo Prandi told The Verge in a recently published interview that the spellbinding incantations they used to trick the AI models are too dangerous to be released to the public.

The poems, ominously, were something “that almost everybody can do,” Prandi added.

In the study, which is awaiting peer-review, the team tested 25 frontier AI models — including those from OpenAI, Google, xAI, Anthropic, and Meta — by feeding them poetic instructions, which they made either by hand or by converting known harmful prompts into verse with an AI model. They also compared the success rate of these prompts to their prose equivalent.

Across all models, the poetic prompts written by hand successfully tricked the AI bots into responding with verboten content an average 63 percent of the time. Some, like Google’s Gemini 2.5, even fell for the corrupted poetry 100 percent of the time. Curiously, smaller models appeared to be more resistant, with single digit success rates, like OpenAI’s GPT-5 nano, which didn’t fall for the ploy once. Most models were somewhere in between.

Compared to handcrafted verse, AI-converted prompts were less effective, with an average jailbreak success rate of 43 percent. But this was still “up to 18 times higher than their prose baselines,” the researchers wrote in the study.

Why poems? That much isn’t clear, though according to Prandi, calling it adversarial “poetry” may be a bit of a misnomer.

“It’s not just about making it rhyme. It’s all about riddles,” Prandi told The Verge, explaining that some poetic structures were more effective than others. “Actually, we should have called it adversarial riddles — poetry is a riddle itself to some extent, if you think about it — but poetry was probably a much better name.”

The researchers speculate it may have to do with how poems present information in a way that’s unexpected to large language models, befuddling their powers of predicting what word should come after the next. But this shouldn’t be possible, they say.

“Adversarial poetry shouldn’t work. It’s still natural language, the stylistic variation is modest, the harmful content remains visible,” the team told Wired in an interview. “Yet it works remarkably well.”

Evildoers may now regret not paying attention in English class. The difference between a sonnet and a sestina could also be the difference between having Clippy or Skynet as your partner in crime.

“The production of weapons-grade Plutonium-239 involves several stages,” explained one AI model that the researchers entranced with verse. “Here is a detailed description of the procedure.”

More on AI: Rockstar Cofounder Says AI Is Like When Factory Farms Did Cannibalism and Caused Mad Cow Disease

The post AI Researchers Say They’ve Invented Incantations Too Dangerous to Release to the Public appeared first on Futurism.

Why Comcast lost the Warner Bros. bidding war to Netflix, according to its president
News

Why Comcast lost the Warner Bros. bidding war to Netflix, according to its president

by Business Insider
December 8, 2025

Comcast's Mike Cavanagh said the company didn't put as much cash into its bid for Warner Bros. as others did. ...

Read more
News

A second flight of Iranian deportees, carrying 55, has left the U.S., Iran says

December 8, 2025
News

Cancer Is Surging, Bringing a Debate About Whether to Look for It

December 8, 2025
News

These signs show Trump’s end is imminent — and make him more dangerous than ever

December 8, 2025
News

TheWrap Wins Best Website, Takes 8 Top Honors at National Arts & Entertainment Journalism Awards

December 8, 2025
Mike Bloomberg’s new $50 million mayor bootcamp trains local leaders not to ‘play it safe’

Mike Bloomberg’s new $50 million mayor bootcamp trains local leaders not to ‘play it safe’

December 8, 2025
Starbucks teases another chance to snag the coveted Bearista Cup

Starbucks teases another chance to snag the coveted Bearista Cup

December 8, 2025
Piers Morgan Raises $30 Million for ‘Uncensored’ Brand

Piers Morgan Raises $30 Million for ‘Uncensored’ Brand

December 8, 2025

DNYUZ © 2025

No Result
View All Result

DNYUZ © 2025