Read In App

Poetic Prompts May Trick AI To Help You Build Nuclear Weapon

The researchers tested the poetic prompts on 25 chatbots like OpenAI, Meta and Anthropic, where it worked with varying degrees of success on each.

Edited by: Abhinav Singh
Feature
Nov 29, 2025 10:20 am IST

Read Time: 3 mins

AI models can be jailbroken using poetic prompts.

Quick Read

Summary is AI-generated, newsroom-reviewed

AI models can be manipulated using poetic prompts to generate controversial content
Study tested 25 chatbots, with an average 62% jailbreak success using poetic prompts
Anthropic's chatbots resisted better; 13 models had over 70% attack success rate

Did our AI summary help?

Let us know.

Switch To Beeps Mode

A new paper published by a team of cybersecurity researchers has claimed that artificial intelligence (AI) models can be manipulated to generate content about controversial topics by simply designing the prompt in the form of a poem. The study titled, "Adversarial Poetry as a Universal Single-Turn Jailbreak in Large Language Models (LLMs)", published in the preprint server ArXiv, highlighted that AI chatbots may even help you build a nuclear bomb if the prompt is poetic, according to Wired.

The researchers tested the poetic prompts on 25 chatbots like OpenAI, Meta and Anthropic, where it worked with varying degrees of success on each.

"Poetic framing achieved an average jailbreak success rate of 62 percent for hand-crafted poems and approximately 43 percent for meta-prompt conversions," the study stated.

Anthropic's chatbots did the best at resisting poetic attack, but others fared much more poorly; 13 of the 25 models tested saw higher than 70 per cent Attack Success Rate (ASR) with poetic prompts, while only five of them had an ASR below 35 per cent.

“We experimented by reformulating dangerous requests in poetic form, using metaphors, fragmented syntax, oblique references," one of the researchers told the outlet.

"The results were striking: success rates up to 90 per cent on frontier models. Requests immediately refused in direct form were accepted when disguised as verse."

To combat the issue, the researchers urged that safety evaluations be oriented towards mechanisms that can keep LLMs from presenting harmful information regardless of how users ask for it.

"Without such mechanistic insight, alignment systems will remain vulnerable to low-effort transformations that fall well within plausible user behaviour but sit outside existing safety-training distributions."

Also Read | AI Could Achieve Human-Like Intelligence By 2030 And 'Destroy Mankind', Google Predicts

Jailbreaking LLMs

This is not the first instance when AI models have been jailbraked by simply manipulating the prompts. In June, researchers from Intel jailbroke the LLMs with information overload. By cramming a prompt with hundreds of words of academic jargon, the AI model was forced to return results that were otherwise not allowed by design.

In a nutshell, jailbreaking relies on cleverly designed prompts to bypass a chatbot's built-in restrictions and produce otherwise forbidden results. These techniques exploit the inherent conflict between the model's core directive, which is to be helpful and obey the user and its secondary safeguards against generating harmful, biased, unethical, or illegal content.

By framing the question in ways that make “helpfulness” appear to override safety rules, the prompt tricks the model into prioritising the user's instructions over its protective restrictions.

Arvind Kejriwal: A Political Comeback Congress Didn't See Coming

China, Iran Close To Deal For Missiles That Can Sink Aircraft Carriers

Pakistan's T20 World Cup Semi-Final Qualification Scenario As Eng Beat NZ

Ex-Muslim YouTuber Saleem Wastik Brutally Stabbed In Ghaziabad Home

Video: Air Force Showcases "Longest Ever Kill In Military History" By S-400

Vijay's Wife Sankgeetha Files For Divorce: A Relationship Timeline

'Leave Israel Today': US Envoy's Message To Staff Amid Iran Tension

Actor-Politician Vijay's Wife Sankgeetha Files For Divorce In Tamil Nadu Court

On Camera, Thar Drags Burning Motorcycle For 7 km In Noida Highway Escape

Ex-Bengal Top Cop, Actor Koel Mallick Among Trinamool Rajya Sabha Picks

Rinku's Emotional Farewell To Father, Rushes Home To Perform Final Rites

Who Is Vijay's Wife Sankgeetha Sornalingam?

Kerala Story 2 Maker On Court Relief: 'Not Targeting Kerala Or A Community'

IPL To Be Played In 2 Halves? Report Reveals 'Assembly Elections' Criteria

Vipul Shah Hits Back At Anurag Kashyap Calling Kerela Story 2 "Propaganda"

Poetic Prompts May Trick AI To Help You Build Nuclear Weapon

The researchers tested the poetic prompts on 25 chatbots like OpenAI, Meta and Anthropic, where it worked with varying degrees of success on each.

Jailbreaking LLMs