I Tested 30 DevOps Tasks with AI to See if AI Can Replace DevOps.

•February 9, 2026

0

Abhishek Veeramalla

Abhishek Veeramalla•Feb 9, 2026

Why It Matters

The findings highlight that AI can accelerate routine DevOps work but still demands expert supervision, underscoring security and reliability risks before enterprises can rely on LLMs for end‑to‑end pipeline automation.

Key Takeaways

•AI models can generate full DevOps pipelines but need human oversight.
•LLMs often use deprecated images, creating security vulnerabilities in deployments.
•Complex tasks like Argo Rollouts cause repeated errors and extensive retries.
•Claude Opus 4.6 required 55 minutes, far exceeding manual 25‑minute benchmark.
•Testing multiple models shows inconsistent handling of Kubernetes manifests and configs.

Summary

The video documents a two‑day experiment where creator Abishank evaluated 20‑30 real‑world DevOps tasks—ranging from beginner to advanced—using several popular large language models (LLMs). He leveraged GitHub Copilot’s ability to switch among models such as Anthropic’s Opus 4.6, OpenAI’s Sonnet 4.5, and Grok 3, running each through a full pipeline: creating a hello‑world Go app, provisioning a Kind cluster, installing Argo CD, and configuring progressive rollouts with Argo Rollouts.

Results revealed that while the models could generate complete manifests and scripts, they frequently introduced problems. Opus 4.6 produced a Dockerfile based on a deprecated Golang version, failed to create the Kind cluster on first attempt, and repeatedly mishandled Argo Rollout CRDs, leading to broken services and misleading success messages. Similar inconsistencies appeared with other models, requiring the tester to intervene, correct deprecated resources, and manually troubleshoot label‑selector mismatches in canary deployments.

Specific examples underscore the shortcomings: the agent claimed a successful canary rollout despite all traffic hitting the original version, and it generated overly complex shell scripts to verify rollouts instead of simple curl checks. Even after multiple retries, the model often proceeded without reporting critical errors, such as CRD installation failures, leaving the operator to diagnose and fix issues.

The experiment concludes that current LLMs can automate routine DevOps steps but cannot replace skilled engineers. Human oversight remains essential for security hygiene, error detection, and nuanced configuration decisions. Organizations considering AI‑driven CI/CD pipelines must factor in the extra validation overhead and potential security risks associated with outdated dependencies and silent failures.

Original Description

Join Membership for Career Guidance:

www.youtube.com/abhishekveeramalla/join

Can AI reply DevOps, Let's put an end to this question as I tested 30 most common DevOps day to day activities with AI. What I found is interesting, Check out the complete video for detailed analysis.

Free Course on the channel

==============================

- DevOps Zero to Hero Playlist: https://www.youtube.com/playlist?list=PLdpzxOOAlwvIKMhk8WhzN1pYoJ1YU8Csa

- AWS Zero to Hero Playlist: https://www.youtube.com/playlist?list=PLdpzxOOAlwvLNOxX0RfndiYSt1Le9azze

- Azure Zero to Hero Playlist: https://www.youtube.com/playlist?list=PLdpzxOOAlwvIcxgCUyBHVOcWs0Krjx9xR

- Terraform Zero to Hero Playlist: https://www.youtube.com/playlist?list=PLdpzxOOAlwvI0O4PeKVV1-yJoX2AqIWuf

- Python for DevOps Playlist: https://www.youtube.com/playlist?list=PLdpzxOOAlwvKwTyYNJCUwGPvql0TrsPgv

About me:

========

Instagram: https://www.instagram.com/abhishekveeramalla_official/

Telegram Channel : https://t.me/abhishekveeramalla

LinkedIn: https://www.linkedin.com/in/abhishek-veeramalla

GitHub: https://github.com/iam-veeramalla

Medium: https://abhishekveeramalla-av.medium.com/

Disclaimer: Unauthorized copying, reproduction, or distribution of this video content, in whole or in part, is strictly prohibited. Any attempt to upload, share, or use this content for commercial or non-commercial purposes without explicit permission from the owner will be subject to legal action. All rights reserved.

0

Comments

Want to join the conversation?

Loading comments...