Why Kubernetes Utilization Is Stuck Below 40%

Techstrong TV (DevOps.com)
Techstrong TV (DevOps.com)Jun 5, 2026

Why It Matters

Persistent underutilization drives massive wasted cloud spend and risks resource shortages for AI workloads; automated, platform-level optimization is increasingly necessary to control costs and preserve reliability.

Summary

Kubernetes clusters routinely run at low utilization—often below 30–40%—because developers overprovision resources out of fear of outages and current monitoring tools leave too much manual work. The panel argues this is not primarily a developer fault but a systemic issue: tuning is continuous, complex, and beyond human scale amid rising deployment velocity and AI-driven workloads. Traditional chargeback models and dashboards haven’t solved it; vendors like Perfect Scale advocate autonomous optimization algorithms to safely reduce waste while avoiding underprovisioning. Adoption is nascent but accelerating as cost pressure and scarcity of GPUs and memory make manual approaches untenable.

Original Description

Kubernetes was supposed to scale itself — so why do real-world clusters rarely run above 30–40% utilization? In this TechStrong TV interview, PerfectScale by DoiT CTO and Co-Founder Eli Birger joins Mike Vizard for a candid conversation about the gap between Kubernetes' promise and what's actually happening in production. Eli unpacks why developers' fear of failure leads to chronic over-provisioning, why monitoring tools were never built to solve this, how AI-driven vibe coding is exploding cluster costs, and why autonomous, battle-tested algorithms — not AI black boxes — are the safest path forward. Plus: keep-it-simple advice, the case for platform engineering, and how to think about Day 2 operations as a continuous optimization process.
In this conversation, Marcin and Alan cover:
• Why Kubernetes utilization rarely climbs above 30–40% in real-world clusters
• How developer fear of crashes leads to chronic over-provisioning
• Why monitoring tools were never designed to solve this problem
• The AI vibe coding boom and its impact on cluster costs
• Why autonomous, battle-proven algorithms beat AI black boxes for production
• The case for platform engineering and Day 2 continuous optimization
Chapters:
00:00 Introduction
00:25 Why Kubernetes utilization stays below 40%
02:30 Why developers can't solve over-provisioning alone
04:15 The AI workload surge and rising cluster costs
06:00 Rethinking how IT teams are organized
07:45 Why automation adoption has lagged
09:30 Cloud cost reckoning after 2023
10:45 Where AI fits — and where battle-proven algorithms win
13:00 Keep it simple: advice for Kubernetes teams
14:30 Has Kubernetes forced the platform engineering conversation?
16:45 Day 0, Day 1, Day 2 — continuous optimization mindset
18:45 Closing
Guest: Eli Birger, CTO & Co-Founder, PerfectScale by DoiT — https://www.perfectscale.io
Host: Mike Vizard, TechStrong Group
Subscribe to TechStrong TV for more interviews with the leaders shaping enterprise tech.
#Kubernetes #FinOps #PlatformEngineering #CloudCost #DevOps

Comments

Want to join the conversation?

Loading comments...