PAST2HARM: A Simple Adaptive Past Tense Attack for Jailbreaking Multimodal AI

arXiv — cs.CLThursday, May 28, 2026 at 4:00:00 AM
  • What Happened

    The introduction of PAST2HARM marks a significant advancement in the field of multimodal AI, presenting a novel adaptive jailbreak framework that effectively circumvents existing safeguards in text-to-image models. This framework exploits the vulnerability of past tense reformulations to bypass refusal training, raising concerns about the potential for harmful image generation.

  • Why It Matters

    The implications of PAST2HARM are profound, as it highlights the inadequacies of current defenses against jailbreak attacks in AI systems, particularly in the context of image generation, which can have more severe consequences than text-based outputs.

  • The Bigger Picture

    This development underscores a growing trend in AI where the reliability of generated images is increasingly questioned, as advancements in models like GPT Image 2 and Nano Banana Pro shift focus from artistic creation to the production of synthetic visual evidence, amplifying the risks associated with misuse and misinformation.

— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about