The Company that Made RAG Mainstream Is Now Betting Against It
Pinecone, the creator of the vector‑database category that popularized retrieval‑augmented generation (RAG), announced Nexus, a knowledge engine that treats retrieval as a legacy pattern. The new platform pre‑compiles source data into typed, cited artifacts and introduces KnowQL, a declarative query language for agents. Pinecone claims Nexus lifts task‑completion rates above 90% while slashing token consumption by 90 percent. The move signals a shift from runtime retrieval to upstream knowledge compilation across the AI‑agent stack.
How NetEase Games Cut LLM Cold Starts From 42 Minutes to 30 Seconds
NetEase Games reduced large language model cold‑start latency from 42 minutes to under 30 seconds by adopting Fluid, a CNCF‑incubated data orchestration layer on Kubernetes. The shift replaced direct cross‑region storage access and a basic Alluxio cache with Fluid’s prefetching...
I Gave Our Developers an AI Coding Assistant. The Security Team Nearly Mutinied
A technology leader approved an AI coding assistant to relieve developers from repetitive tasks, but the security team reacted strongly, fearing uncontrolled code generation. The tool can draft tests, explain legacy code, and suggest refactors, yet it raises questions about...
Why the Linux Foundation Adopted MCP, with Jim Zemlin and Mazin Gilbert
The Linux Foundation has transferred ownership of the Model Context Protocol (MCP), Goose, and AGENTS.md to the newly created Agentic AI Foundation (AAIF). At the MCP Dev Summit in New York, CEO Jim Zemlin announced his step‑back from AAIF leadership,...

Greaves Cotton Appoints Vinay Pawar as Group Chief Technology Officer
Greaves Cotton, the Indian diversified engineering firm, has appointed Vinay Pawar as its Group Chief Technology Officer. Pawar arrives after a 30‑year career spanning ABB, Bosch, Continental, KPIT and most recently Minda Instruments, where he led R&D and the cockpit...
Extreme Moves Toward Autonomous Networking with Advanced AI Agent, Management Tools
Extreme Networks announced its second‑generation AI agent, Extreme Agent ONE, at Extreme Connect 2026, promising proactive, autonomous detection and remediation of network issues. The company also rolled out a major update to its Platform ONE management suite, adding third‑party device...

Apple Plans to Make iOS 27 a Choose Your Own Adventure of AI Models
Apple’s upcoming iOS 27 will let iPhone users pick from multiple on‑device AI models through a new “Extensions” framework. The feature, also slated for iPadOS 27 and macOS 27, will integrate third‑party large language models such as those from Google and Anthropic into...
AWS, IBM Boost Mainframe-Cloud Interoperability
AWS and IBM announced a joint effort to simplify hybrid‑cloud integration between AWS services and IBM Z mainframes. The partnership outlines five reference patterns, including real‑time data streaming enabled by IBM’s $11 billion acquisition of Confluent. AI‑driven tools such as AWS Transform...

Project ARIA Pushes AI From Concept to Soldier-Ready Capability
The U.S. Army’s Project ARIA, launched in March, is moving artificial intelligence from experimental labs to soldier‑ready tools. By partnering with industry and academia, the initiative targets high‑value use cases such as logistics, policy compliance, and daily workflows to cut administrative...
Broadcom Bets Big on VMware Cloud Foundation 9.1
Broadcom unveiled VMware Cloud Foundation 9.1, branding it as an AI‑ and Kubernetes‑native private cloud that supports AMD, Intel and Nvidia hardware. The release targets three pillars: mitigating hardware supply constraints, accelerating AI‑enabled application delivery, and enforcing zero‑trust security. New...
Global Telecom CTO 2026: AI, Open RAN, and Cloud Leadership Reshape Network Strategies
In 2026 telecom operators are transitioning from hardware‑centric networks to AI‑native, software‑defined ecosystems, elevating the CTO to a strategic innovation driver. Leaders at Vodafone, KT, Telefonica, Verizon, AT&T, Telkomsel, STC, Deutsche Telekom and Rakuten are championing Open RAN, sovereign cloud,...

Lili CTO Liran Zelkha on Building AI that Disappears
Fintech firms have spent the past two years layering chatbots and in‑app assistants onto their platforms, but Lili CTO Liran Zelkha argues the effort is misdirected. He believes small‑business owners, who are time‑pressed and uninterested in new interfaces, need AI...
Behind the AI in the Newsroom: The Washington Post’s Vineet Khosla
Washington Post CTO Vineet Khosla explains the paper’s "AI everywhere" strategy, which embeds artificial intelligence across news production and consumer products. The newsroom has rolled out AI‑generated personalized podcasts, now exceeding 100,000 episodes, and internal tools like Haystacker that let...
General Mills Adds Transformation to Tech Chief’s Remit
General Mills has expanded chief digital and technology officer Jaime Montemayor’s role to include transformation, making him chief digital, technology and transformation officer. The change supports a three‑year transformation effort that aims to deliver $600 million in FY2026 savings through AI‑driven...

Interview: Vishal Sharma, CTO of SearchUnify
SearchUnify’s CTO Vishal Sharma outlines how its Agentic AI Suite transforms enterprise support from simple document retrieval to autonomous, end‑to‑end case handling. The platform uses purpose‑built agents, federated retrieval‑augmented generation (FRAG), and real‑time orchestration to triage, resolve, and assist across...

NetApp Names New CTO to Lead EMEA, LATAM Technical Strategy
NetApp has named Jurgen Hofkens, former AWS AI infrastructure leader, as chief technology officer and vice president of sales engineering for its EMEA and LATAM markets. Hofkens will steer technical strategy, customer engagement, and AI‑driven data solutions across on‑premises, edge,...
GoodData.ai CTO Says the Enterprise AI Bubble Is Real, but so Is AI’s Transformational Power
GoodData CTO Peter Fedoročko acknowledges an AI bubble but argues the technology’s value will endure, likening AI to the internet’s evolution into essential infrastructure. He places the market between the Gartner peak of inflated expectations and the trough of disillusionment,...
Diskless Databases: What Happens when Storage Isn’t the Bottleneck
Diskless databases remove local persistence from the critical path, pairing in‑memory indexing with durable object storage. By separating compute from storage, they deliver millisecond‑level latency for ingest and query, even at petabyte scales. The architecture eliminates traditional replication complexity and...

AI Didn’t Reduce the Work, It Forced Us to Redesign It
An e‑commerce firm added an AI agent to its customer‑experience workflow expecting to cut workload. Ticket volume fell roughly 50% and response times improved, but the work did not disappear—it shifted toward system design, escalation handling, and managing uncertainty. The...
SAP to Acquire Data Lakehouse Vendor Dremio
SAP announced it will acquire data‑lakehouse vendor Dremio for an undisclosed price, aiming to embed an Apache Iceberg‑native lakehouse into its Business Data Cloud. Dremio’s technology lets enterprise data stay in‑place, providing federated access and AI‑ready semantics without costly data...
Altruist CTO Departs After Just One Year -- but What a Year; Jason Wenk Cites Strong Bench and Says Search...
Charles Schwab reported Q1 2026 earnings that beat EPS expectations but missed revenue forecasts, prompting a 7.6% share decline. The firm announced a $65 million investment to acquire Wealth.com, aiming to embed AI across estate, tax and portfolio services. Schwab says...
Give AI Agents Safe Access to Your Cluster: Model Context Protocol Server for Red Hat OpenShift Is Now in Technology...
Red Hat has launched a technology‑preview Model Context Protocol (MCP) server for OpenShift, enabling large‑language‑model agents to interact with clusters under strict security controls. The server adds OAuth/OIDC token‑exchange, native RBAC enforcement, read‑only‑by‑default mode, and detailed audit trails to keep AI...

Three Insights You Might Have Missed From theCUBE’s Coverage of Google Cloud Next
Google Cloud Next 2026 highlighted the company’s push to dominate the agentic AI control plane, the data‑centric operating layer that routes information across enterprise systems. Analysts emphasized that contextual data, graph traversal and vector embeddings are now core to AI‑ready...
Inside AMEX’s Agentic Commerce Stack: How Intent Contracts and Single-Use Tokens Enforce AI Transactions
American Express unveiled its Agentic Commerce Experiences (ACE) developer kit, a closed‑loop system that lets AI agents shop and pay on behalf of users within Amex’s own network. The kit introduces intent contracts, proof‑of‑intent tokens and single‑use payment tokens that...
SAP Buys Dremio, Prior Labs for AI Data Push
SAP announced two strategic acquisitions to strengthen its enterprise‑AI data infrastructure. It will buy Dremio, a data‑lakehouse platform, to augment the SAP Business Data Cloud and HANA Cloud with real‑time, non‑SAP data processing. SAP also secured Prior Labs, a startup...
The Orchestration Layer in Enterprise AI Just Got Named. It Has a Gemini Logo on It.
Google Cloud Next 2026 unveiled the Gemini Enterprise Agent Platform, turning Vertex AI into a full‑stack control plane for enterprise agents. The rollout featured a $750 million partner fund, a $240 billion Marketplace backlog, and announced integrations with Salesforce, SAP, ServiceNow, Oracle,...
How OpenAI Delivers Low-Latency Voice AI at Scale
OpenAI re‑engineered its real‑time voice AI infrastructure by separating the WebRTC stack into a lightweight relay and a stateful transceiver. The relay uses the ICE username fragment to route media to the owning transceiver while keeping a minimal public UDP...
The RAG Era Is Ending for Agentic AI — a New Compilation-Stage Knowledge Layer Is What Comes Next
Pinecone announced Nexus, a compilation‑stage knowledge engine designed for agentic AI, moving reasoning from inference time to a pre‑processing layer. The platform adds a context compiler, a composable retriever, and a new declarative query language called KnowQL. In Pinecone’s internal...

Intel Taps Top Qualcomm Exec To Lead Client Computing, Physical AI Group
Intel announced the appointment of Alex Katouzian, a former Qualcomm executive who led the Snapdragon X Series PC effort, as executive vice president and general manager of its Client Computing Group. Katouzian will report directly to CEO Lip‑Bu Tan and is tasked with aligning Intel’s...
SAP’s New API Policy Restricts AI Access, Draws Customer Criticism
SAP has rolled out a new API policy that restricts access to only those interfaces listed in the SAP Business Accelerator Hub or product documentation, labeling all others as unpublished. The policy expressly forbids using APIs for generative AI, large‑scale...
Azure IaaS: Defense in Depth Built on Secure-by-Design Principles
Microsoft’s Azure IaaS blog outlines a defense‑in‑depth model built on three Secure Future Initiative principles—secure by design, secure by default, and secure in operation. It details how hardware roots of trust, measured boot, and Trusted Launch protect the host and...
How OpenAI Scaled to 900 Million Weekly Users with Ory
OpenAI partnered with open‑source identity platform Ory to power its IAM layer as the company surged to 900 million weekly active users. The Ory integration replaced a legacy login system with zero downtime, delivering edge‑based token validation and full observability of...

Cisco To Acquire Astrix To Boost Identity Security For AI Agents
Cisco Systems announced it will acquire identity‑protection startup Astrix Security, a move aimed at strengthening its portfolio for securing AI agents and non‑human identities. While the exact price was not disclosed, industry sources estimate the deal at roughly $400 million, higher...

Inside Amazon Web Services' Plan to Make Networking Disappear
Amazon Web Services is engineering a self‑contained networking stack that makes the network virtually invisible to customers. By consolidating all switching functions onto a single ASIC and running a custom Linux‑based NetOS, AWS can scale its infrastructure while keeping hardware...
Arize AI and Google Cloud Lay Down Standardized Telemetry Mandate to Keep Enterprise Agents in Check
Arize AI and Google Cloud are joining forces to embed OpenTelemetry and OpenInference standards into Google’s Gemini Enterprise Agent Platform. The partnership lets developers instrument AI agents once and ship consistent traces to any observability backend, regardless of the underlying...
Palo Alto Networks Makes a $700M-Class AI Bet on Portkey Gateway
Palo Alto Networks announced its intent to acquire AI‑gateway startup Portkey, a deal valued in the $700 million range. Portkey already routes trillions of tokens each month for Fortune 500 firms and supports 3,000 LLMs, MCP servers, and agents via a single...
StarlingX 12.0 Is Right on Time for Mixed-Hardware Edge Deployments
OpenInfra Foundation released StarlingX 12.0, the first major 2026 update of its open‑source distributed cloud platform used by telecom operators such as Verizon and Vodafone. The release introduces Precision Time Protocol Partial Timing Support, enabling sub‑microsecond synchronization across mixed‑hardware edge...
Cisco Nerds Out: May the Fourth Be with Your AI Assistant
Cisco unveiled "Galaxy Mode" for its AI Assistant, a limited‑time Star Wars‑themed interface for Meraki and Thousand Eyes customers that runs through June 4. The release introduces Deep Reasoning, an AI‑driven analysis engine that interprets network events and offers security compliance...
From Code to Direction: Deriv’s VP of Engineering on Rebuilding the Software Development Pipeline Around AI
Deriv is rebuilding its software development pipeline around AI, moving engineers from hands‑on coders to directors who set intent and standards. The company embeds unified steering documents and quality gates so AI can generate, test, and document code consistently. An...
Shake Shack’s Tech Chief Forges a Practical AI Strategy
Shake Shack is rolling out Project Catalyst, an AI‑driven overhaul that will reach 1,500 restaurants, while simultaneously building an internal AI strategy for its 13,400‑member workforce. Chief information and technology officer Justin Mennen emphasizes practical applications that free staff time...
Small Language Models: Rethinking Enterprise AI Architecture
Enterprises are reshaping AI stacks by routing routine queries to 1‑7 billion‑parameter small language models (SLMs) while reserving trillion‑parameter large language models (LLMs) for complex reasoning. This division of labor can slash cloud inference costs by up to 90 % and deliver...
Agentic Browsers Rewrite the Rules of Enterprise Security
Enterprise browsers are evolving from passive tools to autonomous agents, driven by rapid AI adoption. Deloitte reports 74% of organizations will deploy agentic AI within two years, while 84% of knowledge workers are eager to use it. These agentic browsers...

Amgen Sees C-Suite Shifts
Amgen Inc. announced a cascade of C‑suite reshuffles after Chief Technology Officer David Reese announced his retirement effective June 1. Senior Vice President of AI and Data Sean Bruich will step into the CTO role, while EVP of R&D James Bradner will take...

How a Cloud-Native Architecture Handles Persistent Storage
Enterprises are rapidly embracing cloud‑native architectures, with 82% now running Kubernetes in production, up from 66% a year ago. While containers were originally designed as stateless workloads, modern business applications demand persistent storage, prompting a shift toward stateful solutions. The...

Quantum Readiness for Energy Sector: Not Encryption, Operational Longevity
The article argues that quantum‑readiness for energy firms must be framed around decades‑long asset lifecycles, not short‑term encryption upgrades. With three post‑quantum cryptography standards already ratified, the migration path can span 10‑20 years, matching the operational lifespan of power‑generation and...

The Settlement Layer: How X402 Completes the Agentic Commerce Stack
Coinbase, Cloudflare and Stripe have launched x402, a payment protocol that embeds a settlement handshake directly in HTTP 402 responses, enabling AI agents to complete purchases without human approval. Since its April 2026 debut, the protocol has processed over 165 million...

Uber Wants to Turn Its Millions of Drivers Into a Sensor Grid for Self-Driving Companies
Uber is planning to turn its global fleet of human drivers into a massive sensor network that streams real‑world data to autonomous‑vehicle (AV) developers. CTO Praveen Neppalli Naga said the initiative builds on the AV Labs program, which currently uses...

Code Orange: Fail Small Is Complete. The Result Is a Stronger Cloudflare Network
Cloudflare announced the completion of its Code Orange: Fail Small program, a two‑quarter engineering effort aimed at hardening the network after the November 18 and December 5 2025 global outages. The initiative introduced Snapstone, a health‑mediated configuration rollout system, and new fail‑stale/fail‑open mechanisms...
Fresh Data Has Us Asking, Does AI Demand Kubernetes?
Recent CNCF and SlashData research shows Kubernetes has become the de facto operating system for AI workloads. Two‑thirds of organizations running generative‑AI models use Kubernetes for inference, and overall production adoption of the orchestrator reaches 82 percent. The reports also highlight...
How SUSE Positions Itself as the Infrastructure Layer for the AI Era
SUSE is repositioning from a pure Linux vendor to an AI‑native infrastructure platform, integrating containers, virtual machines and AI services under its Rancher Prime suite. The company unveiled an open AI‑agent ecosystem and a context‑aware assistant named Liz that can...