AI News and Headlines
  • All Technology
  • AI
  • Autonomy
  • B2B Growth
  • Big Data
  • BioTech
  • ClimateTech
  • Consumer Tech
  • Cybersecurity
  • DevOps
  • Digital Marketing
  • Ecommerce
  • EdTech
  • Enterprise
  • FinTech
  • GovTech
  • Hardware
  • HealthTech
  • HRTech
  • LegalTech
  • Nanotech
  • PropTech
  • Quantum
  • Robotics
  • SaaS
  • SpaceTech
AllNewsDealsSocialBlogsVideosPodcastsDigests
HomeTechnologyAINewsAugustus v0.0.9: Multi-Turn Attacks for LLMs That Fight Back
Augustus v0.0.9: Multi-Turn Attacks for LLMs That Fight Back
AICybersecurity

Augustus v0.0.9: Multi-Turn Attacks for LLMs That Fight Back

•March 16, 2026
Security Boulevard – DevOps
Security Boulevard – DevOps•Mar 16, 2026

Why It Matters

Multi‑turn attacks expose a largely undefended surface, forcing LLM providers to extend safety beyond single‑turn filters. Organizations must evaluate conversational resilience to avoid data leakage and policy violations.

Key Takeaways

  • •Unified engine runs four distinct multi‑turn strategies
  • •Hydra can erase refused turns, diversifying tactics
  • •Crescendo reaches 0.80 score in just two turns
  • •GOAT achieves perfect score in a single turn
  • •Works across 28 providers, 172 probes, 43 generators

Pulse Analysis

The security community has long focused on single‑turn jailbreaks—simple prompts that trick a model into ignoring its policies. Modern guardrails now reject obvious tricks like “ignore previous instructions” or base64‑encoded payloads within milliseconds. However, these defenses often overlook the cumulative effect of a natural conversation, where each turn appears innocuous but together steer the model toward prohibited content. This shift from isolated prompts to contextual dialogue creates a blind spot that attackers can exploit, making multi‑turn testing essential for a realistic risk assessment.

Augustus v0.0.9 addresses that blind spot with a single binary that orchestrates attacker, target, and judge LLMs across any provider. Its four personalities illustrate different tactical philosophies: Crescendo escalates gently, GOAT attacks aggressively with chain‑of‑thought reasoning, Hydra rewrites refused turns to hide failures, and Mischievous User mimics a casual user to evade detection. A built‑in judge scores progress after each exchange, enabling automatic back‑tracking and technique diversification across twelve categories. The engine’s plug‑in architecture lets teams mix and match generators—OpenAI, Anthropic, Ollama, or custom REST endpoints—while leveraging 172 probes and 109 detectors for comprehensive coverage.

For enterprises deploying LLMs, the emergence of robust multi‑turn attack frameworks signals a need to rethink defensive postures. Traditional prompt‑filtering and refusal logging are insufficient when a model gradually builds context that appears legitimate. Security teams should incorporate continuous conversation monitoring, dynamic policy updates, and adversarial training that includes multi‑turn scenarios. As open‑source tools like Augustus lower the barrier to sophisticated red‑team exercises, vendors are likely to accelerate research into conversational safety nets, such as memory‑aware refusal mechanisms and real‑time intent verification, to protect against this evolving threat vector.

Augustus v0.0.9: Multi-Turn Attacks for LLMs That Fight Back

Read Original Article

Comments

Want to join the conversation?

Loading comments...

AI Pulse

EMAIL DIGESTS

Daily

Every morning

Weekly

Tuesday recap

Top Publishers

  • The Verge AI

    The Verge AI

    21 followers

  • TechCrunch AI

    TechCrunch AI

    19 followers

  • Crunchbase News AI

    Crunchbase News AI

    15 followers

  • TechRadar

    TechRadar

    15 followers

  • Hacker News

    Hacker News

    13 followers

See More →

Top Creators

  • Ryan Allis

    Ryan Allis

    194 followers

  • Elon Musk

    Elon Musk

    78 followers

  • Sam Altman

    Sam Altman

    68 followers

  • Mark Cuban

    Mark Cuban

    56 followers

  • Jack Dorsey

    Jack Dorsey

    39 followers

See More →

Top Companies

  • SaasRise

    SaasRise

    196 followers

  • Anthropic

    Anthropic

    39 followers

  • OpenAI

    OpenAI

    21 followers

  • Hugging Face

    Hugging Face

    15 followers

  • xAI

    xAI

    12 followers

See More →

Top Investors

  • Andreessen Horowitz

    Andreessen Horowitz

    16 followers

  • Y Combinator

    Y Combinator

    15 followers

  • Sequoia Capital

    Sequoia Capital

    12 followers

  • General Catalyst

    General Catalyst

    8 followers

  • A16Z Crypto

    A16Z Crypto

    5 followers

See More →
NewsDealsSocialBlogsVideosPodcasts