DevOps, open source, and cloud native news with resources and insights for developers
AI chip startup d-Matrix announced a $275 million funding round to advance its in‑memory compute platform for AI inference. The capital will support scaling of its Jetstream accelerator cards and further development of its Digital In‑Memory Compute technology. The round underscores growing investor interest in specialized AI inference hardware.
Anthropic’s new Agent Skills introduce modular, declarative bundles of expertise that agents can load on demand, addressing prompt bloat and context‑window limits. By separating decision‑making logic (agents) from reusable procedural knowledge (skills), developers can extend capabilities without rewriting core agents....
The New Stack tutorial outlines a production‑ready stack for building agentic AI using Retrieval‑Augmented Generation (RAG) and FastAPI. It combines a LangChain‑style reasoning loop, FAISS vector search, schema‑based guardrails, and token‑metered cost controls. The guide adds async execution, retries, semantic...
Astro, now under Cloudflare, released the first beta of Astro 6, introducing a redesigned development server built on Vite’s Environment API. The server runs web applications in the same JavaScript engine used in production, delivering true runtime parity across Node and...
The article outlines how site reliability engineering (SRE) is evolving from reactive alerting toward AI‑driven preventative reliability. Early AI stages improved triage and auto‑remediation, but still required a failure to occur. By mining years of incident post‑mortems, logs, and topology...
Enterprises are racing to embed AI while still relying on legacy applications that run core operations. A recent MIT‑NANDA study shows only 5% of AI pilots transition to production with measurable value, largely because they sit on top of disconnected...
Enterprises facing API sprawl risk security breaches and missed revenue opportunities. IBM’s Neeraj Nargund highlights that AI‑infused “smart APIs” can provide automated discovery, observability, and context‑aware governance across thousands of endpoints. The latest IBM API Connect 12.1 embeds AI throughout...
ScyllaDB has launched X Cloud, a managed NoSQL service that claims to match or exceed DynamoDB’s performance at roughly half the cost. The platform can scale from 100 k to 2 million operations per second while keeping P99 latency in the single‑digit...
The article argues that AI investments in developer productivity are wasted when change approval processes remain heavyweight and batch sizes stay large. It promotes working in small batches and lightweight, automated approvals as the antidote to slow pipelines and compliance...
Google’s Agent Development Kit (ADK) introduces a structured tool framework that lets AI agents invoke external functions, turning them from pure text generators into autonomous actors. ADK categorizes tools into four types—function, built-in, third‑party, and Model Context Protocol (MCP) tools—each...
CloudBees CEO Anuj Kapur warns that many enterprises chase costly migration projects, calling it a “migration mirage.” A recent survey shows 57% of organizations spent over $1 million on migrations last year, with an average $315,000 loss per project due to...
Internal developer platforms often collapse under cloud sprawl, compliance pressure, and AI‑driven automation because they lack a first‑class boundary. The article argues that treating a tenant as an explicit logical construct—binding identity, networking, secrets, compute, observability, and compliance—solves these issues....
Enterprises are rapidly expanding API portfolios from dozens to thousands, exposing gaps in traditional, ad‑hoc development practices. The New Stack outlines four core principles—treat every artifact as code, maintain a single source of truth, automate all repeatable tasks, and delegate...
Ramp introduced Ramp Inspect, an AI‑powered coding assistant integrated into its spend‑management platform. The cloud‑hosted tool writes, tests, and visually verifies code, offering instant virtual‑machine environments without local setup. Ramp open‑sourced the Inspect blueprint, enabling other firms to build similar...
A data pipeline re‑processed Saturday’s transactions 47 times over a weekend after Airflow retries invoked a flawed fallback that loaded the previous successful date instead of the missing data. The bug let each retry succeed while still ingesting stale data,...
The Business Source License (BSL) has emerged as a hybrid model that lets developers use software freely while restricting commercial re‑hosting, a shift sparked by HashiCorp’s move from MPL 2.0 to BSL for Terraform. High‑profile forks such as OpenTofu illustrate...
Enterprises moving LLM pilots to production face soaring infrastructure costs, unpredictable latency, and limited auditability. A growing alternative pairs small language models (SLMs) with retrieval‑augmented generation (RAG), delivering CPU‑friendly inference and data‑grounded outputs. The article outlines a modular, agent‑based architecture...
Postman announced the acquisition of Fern, an open‑source developer‑experience company that provides Fern Docs and an SDK generator supporting languages such as TypeScript, Python, Go, Java, Ruby, C# and PHP. The move expands Postman's API platform with customizable documentation and...
Google Cloud introduced GKE Sandbox for Agents, a Kubernetes‑native extension that runs untrusted or specialized workloads in lightweight, VM‑like sandboxes powered by gVisor. The solution adds a new Sandbox custom resource definition (CRD) and controller, offering stable identities, persistent storage,...
Container rightsizing adjusts Kubernetes CPU and memory requests to reflect real usage, eliminating the drift caused by static, early‑stage settings. Oversized requests waste capacity, inflate node counts, and can force larger instance types, while undersized settings risk performance issues. The...
Enterprise AI is moving from simple assistants that merely aid human tasks to autonomous agents that execute within guardrails, and finally to coordinated multi‑agent systems that own end‑to‑end outcomes. The article outlines a three‑phase roadmap over the next two years,...
Bryan Cantrill argues that the 2014 release of Kubernetes disrupted Amazon Web Services' near‑monopoly by giving enterprises a cloud‑agnostic orchestration layer. He notes AWS’s 2014 revenue of $4.6 billion and its aggressive price‑cut strategy, which left rivals scrambling. Kubernetes’ open‑source model...
The article outlines a transition from single‑user, prompt‑driven AI to multiplayer AI agents that embed themselves in team conversations and retain project context over time. It describes four core agent roles—context, coordination, synthesis, and execution—that work together to surface information,...
Ari Zilka, former Hortonworks CPO and founder of MyDecisive.ai, highlighted at KubeCon 2025 that 23 observability vendors are delivering nearly identical reactive dashboards that only surface problems after failures occur. He argued that CIOs consistently demand lower mean time to...
Engineering teams are moving from noisy, threshold‑based alerts to higher‑order signals that embed context, confidence, and actionable insight. Alerts, once sufficient for simple infrastructures, now generate fatigue as they treat every deviation equally. Signals combine correlation, relevance to business impact,...
The article argues that separating the control plane from the data plane is reshaping SaaS delivery. In a decoupled model, the vendor provides the control plane as a managed service while customers run the data plane in their own cloud...
Martin Fowler, chief scientist at Thoughtworks, frames large language models as a move to nondeterministic computing, comparable to the historic shift from assembly language to high‑level languages. He contrasts deterministic binary logic with the probabilistic nature of LLM outputs, emphasizing...
A critical outage at 3:17 a.m. exposed a single‑point failure in a customer‑facing API, costing $14,000 in SLA credits and eroding trust. The incident led the author to create the “3 a.m. Test,” a design checklist that ensures on‑call engineers can quickly...