DORA Metrics Are Lying to You and AI Is Making It Worse

DORA Metrics Are Lying to You and AI Is Making It Worse

LeadDev (independent publication)
LeadDev (independent publication)May 6, 2026

Key Takeaways

  • DORA tracks flow, not developers' understanding of code.
  • AI‑generated code can inflate deployment frequency while hiding bugs.
  • Invisible dependencies increase blast radius despite low change‑failure rates.
  • Add scope notes and incident narratives to DORA reviews for critical systems.

Pulse Analysis

Since the 2010s, DORA’s four metrics—deployment frequency, lead time, change‑failure rate, and MTTR—have become the lingua franca for assessing software delivery velocity. By quantifying how quickly code moves from commit to production, they gave executives a clear, comparable view of engineering efficiency across teams and industries. The simplicity of the numbers encouraged widespread adoption, turning DORA dashboards into a staple of modern DevOps reporting and a proxy for team health.

The AI boom has upended that simplicity. Large‑language models now draft pull‑request code, write configuration files, and even suggest infrastructure changes. While this accelerates deployment frequency, it also reduces the human vetting layer that traditionally caught logical flaws. Teams can ship AI‑produced artifacts they cannot fully explain, inflating the DORA scores while the system’s legibility deteriorates. Hidden dependencies—such as third‑party identity providers or AI‑driven orchestration scripts—expand the blast radius of failures, meaning incidents may surface far downstream, far beyond what change‑failure rates capture.

To restore balance, organizations should treat DORA as a delivery gauge, not a health verdict. Embedding scope notes, dependency maps, and recent incident narratives into each DORA review forces a sanity check on system comprehension. Regular “explain‑in‑five‑minutes” drills and knowledge‑distribution metrics can surface concentration risks before attrition hits. By coupling quantitative flow data with qualitative legibility assessments, leaders gain a more truthful picture of risk, enabling better governance, regulatory compliance, and ultimately, more resilient software ecosystems.

DORA metrics are lying to you and AI is making it worse

Comments

Want to join the conversation?