Kubernetes Serverless Without the Vendor Lock-In (Here's How)

The DevOps Toolkit (Viktor Farcic)
The DevOps Toolkit (Viktor Farcic)Mar 30, 2026

Why It Matters

It shows enterprises how to cut idle infrastructure costs and avoid vendor lock‑in by implementing true zero‑to‑many scaling on standard Kubernetes, while highlighting the trade‑offs of cold‑start latency and added complexity.

Key Takeaways

  • Kubernetes can scale to zero without vendor‑locked serverless platforms.
  • KDA plus Envoy Gateway and Prometheus enable auto‑scaling.
  • Crossplane provisions full cluster stack across AWS, GCP, Azure automatically.
  • HTTP interceptor holds requests during cold starts, preventing loss.
  • Trade‑offs include cold‑start latency and added buffering complexity.

Summary

The video demonstrates how to achieve true serverless behavior—automatic scaling to zero and back—using plain Kubernetes rather than proprietary services like AWS Lambda. By combining Crossplane, Envoy Gateway, KEDA (referred to as KDA), Prometheus, and a pod‑monitor, the author builds a self‑contained platform that provisions the entire stack on any cloud provider and wires the components together without vendor lock‑in.

Key technical steps include defining a minimal cluster spec, letting Crossplane create the underlying infrastructure and install the required system apps, and then deploying an example workload. The demo progresses from a single static replica, to KEDA‑driven autoscaling based on Prometheus‑collected request metrics, and finally to a min‑replica setting of zero. An HTTP interceptor added by KEDA’s HTTP add‑on buffers incoming requests while pods spin up, ensuring no request loss during cold starts.

During the live test, 100,000 requests at 200 RPS are sent repeatedly. With static scaling, one pod handles the load; with KEDA autoscaling, the system expands to the configured maximum of five pods and contracts back to the minimum when traffic subsides. When the minimum is zero, the interceptor holds the first request, triggers a scale‑up, and then forwards the buffered request, confirming zero‑loss behavior despite a brief cold‑start delay.

The approach offers cost efficiency and multi‑cloud flexibility, but introduces cold‑start latency and the need for request buffering. Organizations can thus adopt a serverless‑like model on Kubernetes without surrendering control to a single cloud vendor, balancing operational simplicity against the modest overhead of an extra gateway and interceptor layer.

Original Description

This video demonstrates how to implement true scale-to-zero capabilities in Kubernetes without relying on specialized serverless platforms like Knative or AWS Lambda. Using standard Kubernetes components—Envoy Gateway, KEDA, Prometheus, and the KEDA HTTP Add-on—the tutorial shows how to build a system that automatically scales applications from zero to multiple replicas based on actual traffic demand, without losing any incoming requests.
The demonstration progresses through three stages: starting with a static single-replica deployment, adding Prometheus-based autoscaling that adjusts between one and five replicas based on request rates, and finally achieving complete scale-to-zero functionality. The key innovation is using KEDA's HTTP interceptor to hold requests in memory during cold starts while pods spin up, ensuring zero request loss even when scaling from nothing. The video also covers using Crossplane to provision fully configured clusters with all necessary components pre-installed. While acknowledging trade-offs like cold-start latency and interceptor memory limits, the approach offers significant resource efficiency by keeping idle applications at zero replicas, all while maintaining standard Kubernetes primitives and avoiding vendor lock-in.
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Sponsor: Kilo Code
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#Kubernetes #ScaleToZero #KEDA
Consider joining the channel: https://www.youtube.com/c/devopstoolkit/join
▬▬▬▬▬▬ 🔗 Additional Info 🔗 ▬▬▬▬▬▬
🔗 Crossplane: https://crossplane.io
▬▬▬▬▬▬ 💰 Sponsorships 💰 ▬▬▬▬▬▬
If you are interested in sponsoring this channel, please visit https://devopstoolkit.live/sponsor for more information. Alternatively, feel free to contact me over Twitter or LinkedIn (see below).
▬▬▬▬▬▬ 👋 Contact me 👋 ▬▬▬▬▬▬
▬▬▬▬▬▬ 🚀 Other Channels 🚀 ▬▬▬▬▬▬
▬▬▬▬▬▬ ⏱ Timecodes ⏱ ▬▬▬▬▬▬
00:00 Serverless in Kubernetes
01:35 Kilo Code (sponsor)
03:05 Kubernetes Scale-to-Zero Ready Cluster
07:45 From Static Replicas to Scale-to-Zero
16:48 Kubernetes Serverless Pros and Cons

Comments

Want to join the conversation?

Loading comments...