The New Stack

The New Stack

Publication
0 followers

DevOps, open source, and cloud native news with resources and insights for developers

Vultr Says Its Nvidia-Powered AI Infrastructure Costs 50% to 90% Less than Hyperscalers
NewsApr 3, 2026

Vultr Says Its Nvidia-Powered AI Infrastructure Costs 50% to 90% Less than Hyperscalers

Vultr announced an Nvidia‑powered AI infrastructure that it says costs 50% to 90% less than comparable offerings from major hyperscalers. The service lets platform engineering teams train AI agents on internal security, networking and compliance policies, then expose those as...

By The New Stack
Digital Experience Monitoring Belongs in the Modern Developer Workflow
NewsApr 3, 2026

Digital Experience Monitoring Belongs in the Modern Developer Workflow

Digital Experience Monitoring (DEM) is reshaping observability by tying frontend performance and real‑user outcomes to backend telemetry. The article explains how DEM integrates synthetic testing, Core Web Vitals, and crash data into developers' daily workflow, from CI/CD pipelines to incremental...

By The New Stack
The Hidden Reason Your AI Assistant Feels so Sluggish
NewsApr 3, 2026

The Hidden Reason Your AI Assistant Feels so Sluggish

AI assistants feel sluggish because agent‑driven workloads overload traditional data platforms. Large language models fire dozens of concurrent short SQL queries per prompt, demanding sub‑second response times that batch‑oriented warehouses can’t meet. This mismatch drives latency and rising costs, prompting...

By The New Stack
Why Broadcom Gave Velero to the CNCF Sandbox — and What It Means for Kubernetes Data Protection
NewsApr 2, 2026

Why Broadcom Gave Velero to the CNCF Sandbox — and What It Means for Kubernetes Data Protection

Broadcom has transferred ownership of the Velero backup and recovery project to the CNCF Sandbox, moving governance away from its VMware unit. The donation aims to eliminate perceived proprietary control and encourage broader community contributions. Broadcom positions this move as...

By The New Stack
OpenClaw Vs. Hermes Agent: The Race to Build AI Assistants that Never Forget
NewsApr 2, 2026

OpenClaw Vs. Hermes Agent: The Race to Build AI Assistants that Never Forget

Developers are frustrated by AI coding assistants that lose context after each session, prompting a split in the AI agent market between short‑lived tools and always‑on agents. OpenClaw and Hermes Agent represent the latter, offering persistent runtimes that remember past...

By The New Stack
Edge-Forward: Akamai Eyes Sweet Spot Between Centralized & Decentralized AI Inference
NewsApr 1, 2026

Edge-Forward: Akamai Eyes Sweet Spot Between Centralized & Decentralized AI Inference

Akamai is shifting from a pure CDN to a developer‑centric cloud platform that blends centralized data centers with a massive edge network for AI inference. The company leverages 41 core datacenters and roughly 4,400 smaller edge locations to run managed...

By The New Stack
Why Cursor Is Bringing Self-Hosted AI Agents to the Fortune 500
NewsMar 31, 2026

Why Cursor Is Bringing Self-Hosted AI Agents to the Fortune 500

Cursor is launching self‑hosted cloud agents that let Fortune 500 firms run its AI coding assistants inside their own infrastructure, keeping source code, tests and build artifacts on‑premise. The move tackles security, compliance and latency concerns that have limited enterprise...

By The New Stack
Portkey Open-Sources Its AI Gateway After Processing 2 Trillion Tokens a Day
NewsMar 31, 2026

Portkey Open-Sources Its AI Gateway After Processing 2 Trillion Tokens a Day

Portkey has released its unified Portkey Gateway as an open‑source project after the service began processing two trillion tokens per day and handling more than 120 million AI requests. The platform currently supports $180 million of annualized AI spend across roughly 24,000...

By The New Stack
Claude Code Users Say They’re Hitting Usage Limits Faster than Normal
NewsMar 31, 2026

Claude Code Users Say They’re Hitting Usage Limits Faster than Normal

Anthropic has confirmed that Claude Code users are exhausting their usage limits far faster than before, with some prompts consuming up to 10% of a monthly quota and simple greetings using 2% of a session. Users on the $100‑per‑month plan...

By The New Stack
AI Accelerates Modernization, but Don’t Leave Human Devs Behind
NewsMar 31, 2026

AI Accelerates Modernization, but Don’t Leave Human Devs Behind

AI-driven tools are reshaping application modernization by scanning, summarizing, and refactoring legacy code far faster than human‑only teams. The market reacted when Anthropic announced Claude Code could modernize COBOL, prompting a sharp IBM stock dip. While AI accelerates repetitive tasks,...

By The New Stack
Microsoft’s Copilot Makes Anthropic’s Claude and OpenAI’s GPT Team Up
NewsMar 30, 2026

Microsoft’s Copilot Makes Anthropic’s Claude and OpenAI’s GPT Team Up

Microsoft has integrated Anthropic’s Claude and OpenAI’s GPT into Copilot’s Researcher agent, adding a ‘critique’ step where GPT drafts content and Claude reviews it for accuracy and citation integrity. The combined workflow achieved a 57.4 score on Perplexity’s DRACO benchmark,...

By The New Stack
WebAssembly Is Now Outperforming Containers at the Edge
NewsMar 29, 2026

WebAssembly Is Now Outperforming Containers at the Edge

WebAssembly’s emerging Component Model 1.0 is poised to eclipse containers for edge and serverless workloads by delivering millisecond‑level code deployment and superior isolation. Recent talks at Wasm I/O highlighted Preview 3, which adds async functions, lazy APIs, and concurrency primitives, moving...

By The New Stack
How Platform Teams Are Eliminating a $43,800 “Hidden Tax” On Kubernetes Infrastructure
NewsMar 28, 2026

How Platform Teams Are Eliminating a $43,800 “Hidden Tax” On Kubernetes Infrastructure

Platform teams are tackling a hidden $43,800 annual tax caused by provisioning separate managed Kubernetes control planes for each tenant. A single Amazon EKS control plane costs about $0.10 per hour, which scales linearly with the number of clusters. Virtual‑cluster...

By The New Stack
Build It Yourself: A Data Pipeline that Trains a Real Model
NewsMar 28, 2026

Build It Yourself: A Data Pipeline that Trains a Real Model

The article explains what a data pipeline is, why it’s essential for AI, and provides a step‑by‑step tutorial to build a simple pipeline that simulates temperature data, trains a linear regression model with scikit‑learn, and generates predictions. It outlines the...

By The New Stack
Gitleaks Creator Returns with Betterleaks, an Open Source Secrets Scanner for the Agentic Era
NewsMar 27, 2026

Gitleaks Creator Returns with Betterleaks, an Open Source Secrets Scanner for the Agentic Era

The creator of the popular secret‑scanning tool Gitleaks has launched Betterleaks, an open‑source scanner designed as a drop‑in replacement with faster performance and more flexible validation. Backed by AI‑focused security startup Aikido, Betterleaks swaps hard‑coded entropy checks for CEL‑based rules...

By The New Stack
AI Can Write Your Infrastructure Code. There’s a Reason Most Teams Won’t Let It.
NewsMar 20, 2026

AI Can Write Your Infrastructure Code. There’s a Reason Most Teams Won’t Let It.

Spacelift co‑founder Marcin Wyszynski says AI is now writing infrastructure‑as‑code in HCL, eliminating the need for developers to hand‑craft Terraform or OpenTofu configurations. While this speeds provisioning, it creates a comprehension gap that can lead to dangerous production changes. Spacelift’s...

By The New Stack
Why Flat Kubernetes Networks Fail at Scale
NewsMar 20, 2026

Why Flat Kubernetes Networks Fail at Scale

Flat Kubernetes networking models work for small clusters but break at scale. As policies proliferate, the lack of hierarchy leads to unpredictable rule precedence and debugging challenges. Introducing security hierarchies—platform, security, and application tiers—adds explicit ordering and aligns with Zero...

By The New Stack
Linux Kernel Scale Is Swamping an Already-Flawed CVE System
NewsMar 20, 2026

Linux Kernel Scale Is Swamping an Already-Flawed CVE System

The Linux kernel became a CVE Numbering Authority in 2024, prompting a policy shift that assigns identifiers to virtually every defect. In 2025 the kernel topped vulnerability lists with over 48,000 CVEs, flooding security feeds with low‑impact and theoretical issues...

By The New Stack
Jellyfish AI Development Study: The Real Sting Has yet to Land
NewsMar 19, 2026

Jellyfish AI Development Study: The Real Sting Has yet to Land

Jellyfish’s AI Engineering Trends study, covering 700+ companies, 200,000 engineers and 20 million pull requests, shows that more than half of firms now use AI coding tools daily and 64 % generate the majority of their code with AI assistance. Teams in...

By The New Stack
The AI Revolution Will Be Open-Sourced
NewsMar 18, 2026

The AI Revolution Will Be Open-Sourced

At CES 2026 NVIDIA CEO Jensen Huang argued that AI will only scale when open innovation extends to infrastructure. Kubernetes, long used for AI, is gaining first‑class support through Dynamic Resource Allocation (DRA) that reached GA in version 1.34. New...

By The New Stack
Capital One Deprecated an AI Tool It Once Championed. Its DevEx Chief Says That’s the Point.
NewsMar 18, 2026

Capital One Deprecated an AI Tool It Once Championed. Its DevEx Chief Says That’s the Point.

Capital One’s developer experience (DevEx) team, led by SVP Catherine McGarvey, recently retired an AI‑driven ticket‑assignment tool after engineers expressed dissatisfaction. The group emphasizes "enablement"—providing the right tools, knowledge, and feedback—to boost productivity across its 14,000 technologists. AI tooling is...

By The New Stack
Why Your Observability Bill Keeps Growing (and It’s Not Your Vendor’s Fault)
NewsMar 18, 2026

Why Your Observability Bill Keeps Growing (and It’s Not Your Vendor’s Fault)

Observability spend is exploding across large engineering orgs, not because of vendor pricing but due to unchecked telemetry generation. Companies report monthly bills exceeding $200,000 while most data lacks proper service attribution and often contains leaked credentials. Auto‑instrumentation and high‑cardinality...

By The New Stack
Chainguard Thinks Most DevOps Teams Are Solving Container Security the Hard Way
NewsMar 17, 2026

Chainguard Thinks Most DevOps Teams Are Solving Container Security the Hard Way

Chainguard unveiled OS Packages, a beta service that lets DevOps teams assemble custom container images from zero‑CVE, source‑built packages. The offering leverages Chainguard’s Factory 2.0 pipeline to continuously rebuild over 30,000 enterprise‑grade packages and generate SBOMs automatically. Teams can use...

By The New Stack
SurePath AI Advances MCP Policy Controls to Tighten the Cable on AI’s USB-C
NewsMar 12, 2026

SurePath AI Advances MCP Policy Controls to Tighten the Cable on AI’s USB-C

SurePath AI unveiled MCP Policy Controls, a real‑time governance layer for Model Context Protocol (MCP) interactions. The service monitors and restricts which MCP servers and tools a codebase may use, automatically blocking or allowing payloads based on policy. In a...

By The New Stack
Runpod Report: Qwen Has Overtaken Meta’s Llama as the Most-Deployed Self-Hosted LLM
NewsMar 12, 2026

Runpod Report: Qwen Has Overtaken Meta’s Llama as the Most-Deployed Self-Hosted LLM

Runpod’s State of AI report, built on anonymized serverless deployment logs, shows Alibaba Cloud’s Qwen family has become the most‑deployed self‑hosted large language model, overtaking Meta’s Llama. Despite heavy marketing, Llama 4 registers near‑zero adoption, indicating developers prioritize performance per dollar,...

By The New Stack
Galileo Releases Agent Control, a Centralized Guardrails Platform for Enterprise AI Agents
NewsMar 11, 2026

Galileo Releases Agent Control, a Centralized Guardrails Platform for Enterprise AI Agents

Galileo unveiled Agent Control, an open‑source Apache‑2.0 control plane that lets enterprises enforce unified guardrails across AI agents. The platform lets developers write policies once and apply them in real time without downtime, and it ships with SDKs for easy...

By The New Stack
Tetrate Launches Open Source Marketplace to Simplify Envoy Adoption
NewsMar 11, 2026

Tetrate Launches Open Source Marketplace to Simplify Envoy Adoption

Tetrate has introduced Built on Envoy, a free, open‑source marketplace that bundles ready‑to‑use Envoy extensions. The platform addresses common adoption hurdles such as security integration, authentication, and AI governance by providing pre‑built modules for WAF, OAuth2, SAML, and content‑safety checks....

By The New Stack
Publish Your Data, AI Techniques, and Agentic Engineering Work on Towards Data Science
NewsMar 11, 2026

Publish Your Data, AI Techniques, and Agentic Engineering Work on Towards Data Science

The New Stack is inviting its readers to publish on Towards Data Science, one of the largest AI‑focused publications. Submissions are free, and accepted pieces receive editorial support, rapid two‑day turnaround, and promotion across TDS’s channels. Authors can earn money...

By The New Stack
How to Deploy an AI Server on Your Debian/Ubuntu Server
NewsMar 10, 2026

How to Deploy an AI Server on Your Debian/Ubuntu Server

The article walks through deploying a private AI server on Debian or Ubuntu using Ollama and Docker. It starts by adding the user to the sudo and Docker groups, then installs Ollama, pulls the llama3.2 model, and configures it for...

By The New Stack
How Context Rot Drags Down AI and LLM Results for Enterprises, and How to Fix It
NewsMar 9, 2026

How Context Rot Drags Down AI and LLM Results for Enterprises, and How to Fix It

Enterprises deploying AI agents and large language models are increasingly hampered by “context rot,” where stale or conflicting data floods the model’s limited attention window, degrading accuracy and causing hallucinations. As token volumes swell, LLMs exhaust their context budgets, leading...

By The New Stack
Moving AI Apps From Prototype to Production Requires Enterprise-Grade Postgres Infrastructure
NewsMar 9, 2026

Moving AI Apps From Prototype to Production Requires Enterprise-Grade Postgres Infrastructure

AI adoption surged to 78% of organizations in 2024, yet most initiatives remain prototypes. A new Apptio survey shows 90% of tech leaders can’t measure AI ROI, highlighting the gap between experimentation and production. Traditional databases lack vector search and...

By The New Stack
AI Coding Agents Can Write Code, Crafting Wants to Help Them Ship It
NewsMar 9, 2026

AI Coding Agents Can Write Code, Crafting Wants to Help Them Ship It

AI coding agents can generate code, but enterprise teams need testing and deployment in production-like environments. Crafting, a San Francisco startup founded by ex‑Google, Meta, Uber and Discord leaders, announced the general availability of Crafting for Agents, a platform that...

By The New Stack
Snowflake Cortex Code CLI Adds Dbt and Apache Airflow Support for AI-Powered Data Pipelines
NewsMar 8, 2026

Snowflake Cortex Code CLI Adds Dbt and Apache Airflow Support for AI-Powered Data Pipelines

Snowflake has expanded its Cortex Code CLI, an AI‑driven coding agent, to support the open‑source data‑pipeline frameworks dbt and Apache Airflow. The extension leverages Anthropic’s Agent Skills to automate debugging, testing, and optimization of pipelines, and is offered through a new...

By The New Stack
Claude’s Free Plan Can Now Remember You
NewsMar 2, 2026

Claude’s Free Plan Can Now Remember You

Anthropic has extended Claude's memory feature to its free tier, allowing users to import conversation histories from other AI services. The memory tool automatically captures context, can be edited manually, and can be disabled for privacy. Free‑plan adoption has surged...

By The New Stack
Most Platform Teams Build Products, but They Don’t Know It
NewsFeb 24, 2026

Most Platform Teams Build Products, but They Don’t Know It

Platform teams often treat internal platforms as pure infrastructure, overlooking their product nature. By failing to define specific user personas, they ship technically complete features that see low adoption. The article stresses that rollout activities differ from genuine adoption, which...

By The New Stack
Cloudflare’s Markdown for Agents Automatically Make Websites Agent-Ready
NewsFeb 22, 2026

Cloudflare’s Markdown for Agents Automatically Make Websites Agent-Ready

Cloudflare introduced “Markdown for Agents,” an edge service that converts HTML pages to Markdown when an AI agent requests them via an Accept: text/markdown header. The conversion can slash token consumption by up to 80%, turning a 16,180‑token HTML page...

By The New Stack
Why the Era of Relying on Dozens of “Purpose-Built” Databases Is Finally Coming to an End
NewsFeb 20, 2026

Why the Era of Relying on Dozens of “Purpose-Built” Databases Is Finally Coming to an End

Enterprises are shifting from fragmented, purpose‑built databases to unified operational data platforms that prioritize memory‑first architectures and AI‑ready features. The new platforms deliver sub‑millisecond response times, reduce infrastructure complexity, and cut total cost of ownership by up to 60%. By...

By The New Stack
GitLab CEO on Why AI Isn’t Helping Enterprise Ship Code Faster
NewsFeb 10, 2026

GitLab CEO on Why AI Isn’t Helping Enterprise Ship Code Faster

GitLab CEO Bill Staples says AI coding assistants haven’t accelerated enterprise software delivery because developers spend only 10‑20% of their day writing code. The remaining 80‑90% involves reviews, pipeline runs, security and compliance checks that remain untouched by AI. GitLab’s...

By The New Stack
Is the SaaSpocalypse Nigh? The Era of Paying for Software Seats May Be Ending.
NewsFeb 6, 2026

Is the SaaSpocalypse Nigh? The Era of Paying for Software Seats May Be Ending.

Anthropic’s Cowork plugins, an AI‑agent extension suite, sparked a sharp market sell‑off in legacy SaaS firms, with legal‑tech stocks falling 10‑16%. The reaction validates Satya Nadella’s 2024 warning that SaaS applications could collapse as AI agents absorb business logic. By...

By The New Stack
Focus on ‘Don’ts’ to Build Systems that Know when to Say ‘No’
NewsJan 21, 2026

Focus on ‘Don’ts’ to Build Systems that Know when to Say ‘No’

The article argues that AI knowledge bases must go beyond listing policies and include explicit "don’ts" to prevent harmful behavior. By integrating negative examples, decision‑logic trees, and relational knowledge graphs, organizations can give agents contextual guardrails and judgment akin to...

By The New Stack
AI Agents or Skills? Why the Answer Is ‘Both’
NewsJan 20, 2026

AI Agents or Skills? Why the Answer Is ‘Both’

Anthropic’s new Agent Skills introduce modular, declarative bundles of expertise that agents can load on demand, addressing prompt bloat and context‑window limits. By separating decision‑making logic (agents) from reusable procedural knowledge (skills), developers can extend capabilities without rewriting core agents....

By The New Stack
How To Build Production-Ready AI Agents With RAG and FastAPI
NewsJan 20, 2026

How To Build Production-Ready AI Agents With RAG and FastAPI

The New Stack tutorial outlines a production‑ready stack for building agentic AI using Retrieval‑Augmented Generation (RAG) and FastAPI. It combines a LangChain‑style reasoning loop, FAISS vector search, schema‑based guardrails, and token‑metered cost controls. The guide adds async execution, retries, semantic...

By The New Stack
Astro Redesigns Its Development Server
NewsJan 17, 2026

Astro Redesigns Its Development Server

Astro, now under Cloudflare, released the first beta of Astro 6, introducing a redesigned development server built on Vite’s Environment API. The server runs web applications in the same JavaScript engine used in production, delivering true runtime parity across Node and...

By The New Stack
The Future of AI in SRE: Preventing Failures, Not Fixing Them
NewsJan 17, 2026

The Future of AI in SRE: Preventing Failures, Not Fixing Them

The article outlines how site reliability engineering (SRE) is evolving from reactive alerting toward AI‑driven preventative reliability. Early AI stages improved triage and auto‑remediation, but still required a failure to occur. By mining years of incident post‑mortems, logs, and topology...

By The New Stack
Orchestration: The Key to Integrating AI with Legacy Systems
NewsJan 16, 2026

Orchestration: The Key to Integrating AI with Legacy Systems

Enterprises are racing to embed AI while still relying on legacy applications that run core operations. A recent MIT‑NANDA study shows only 5% of AI pilots transition to production with measurable value, largely because they sit on top of disconnected...

By The New Stack
Solving the Problems That Accompany API Sprawl With AI
NewsJan 15, 2026

Solving the Problems That Accompany API Sprawl With AI

Enterprises facing API sprawl risk security breaches and missed revenue opportunities. IBM’s Neeraj Nargund highlights that AI‑infused “smart APIs” can provide automated discovery, observability, and context‑aware governance across thousands of endpoints. The latest IBM API Connect 12.1 embeds AI throughout...

By The New Stack
ScyllaDB’s New Cloud Challenges DynamoDB Cost, Performance
NewsJan 15, 2026

ScyllaDB’s New Cloud Challenges DynamoDB Cost, Performance

ScyllaDB has launched X Cloud, a managed NoSQL service that claims to match or exceed DynamoDB’s performance at roughly half the cost. The platform can scale from 100 k to 2 million operations per second while keeping P99 latency in the single‑digit...

By The New Stack
Stop Wasting AI Investment on a Broken Change Approval Process
NewsJan 15, 2026

Stop Wasting AI Investment on a Broken Change Approval Process

The article argues that AI investments in developer productivity are wasted when change approval processes remain heavyweight and batch sizes stay large. It promotes working in small batches and lightweight, automated approvals as the antidote to slow pipelines and compliance...

By The New Stack
How To Choose the Right Tool for Your Google ADK Agent
NewsJan 14, 2026

How To Choose the Right Tool for Your Google ADK Agent

Google’s Agent Development Kit (ADK) introduces a structured tool framework that lets AI agents invoke external functions, turning them from pure text generators into autonomous actors. ADK categorizes tools into four types—function, built-in, third‑party, and Model Context Protocol (MCP) tools—each...

By The New Stack