Nvidia May Soon Unveil a Brand-New AI Chip. A Closer Look at the $20 Billion Bet to Make It Happen

•March 13, 2026

CNBC Technology•Mar 13, 2026

Why It Matters

The move strengthens Nvidia’s foothold in the fast‑growing inference segment, directly challenging rivals like AMD, Google’s TPUs, and Amazon’s Trainium, and could unlock new revenue streams beyond its traditional GPU business.

Key Takeaways

•Nvidia spent $20 B licensing Groq’s inference technology.
•New chip targets AI inference, not training workloads.
•Integration may mirror Nvidia’s successful Mellanox acquisition.
•Boosts Nvidia’s inference share amid rising competition.
•GTC will likely unveil details on Groq‑based accelerator.

Pulse Analysis

Nvidia’s $20 billion investment in Groq marks a decisive shift toward specialized inference hardware, a segment that now accounts for roughly 40% of its data‑center revenue. As AI models move from research labs to production environments, cost‑effective, low‑latency inference becomes a priority for enterprises. Competitors such as AMD, Google’s TPU line, and Amazon’s Trainium are already courting the same market, intensifying pressure on Nvidia to diversify beyond its training‑centric GPUs. By licensing Groq’s LPU architecture, Nvidia aims to deliver chips that prioritize speed and efficiency, leveraging on‑chip SRAM to sidestep the high‑cost, supply‑tight high‑bandwidth memory (HBM) used in traditional GPUs.

Technically, Groq’s LPUs differ from Nvidia’s GPUs in two key ways: memory hierarchy and execution model. The LPU’s on‑die SRAM enables ultra‑fast data access, reducing latency for real‑time inference tasks, while GPUs rely on parallel processing power suited for massive training workloads. Ross’s vision of a hybrid system—running portions of a model on LPUs and the remainder on GPUs—could create a “nitro‑boost” effect, improving throughput and lowering operational costs for existing GPU fleets. This approach also aligns with Nvidia’s broader strategy of building a heterogeneous AI stack, where CPUs, GPUs, networking, and now inference accelerators work in concert.

Strategically, the Groq deal echoes Nvidia’s 2019 acquisition of Mellanox, which transformed the company into a one‑stop AI infrastructure provider by adding high‑performance networking. If the Groq integration proves as lucrative, Nvidia could replicate that revenue boost, potentially adding double‑digit growth to its inference line and reinforcing its position as the default AI computing platform. Investors will be watching GTC closely; clear product roadmaps and early customer wins could translate into significant market share gains and justify the hefty price tag of the Groq partnership.