
The AI Hardware Show episode dives deep into the rapidly evolving LLM inference market, profiling a suite of startups that are redefining data‑center acceleration. Hosts Sally Ward Foxton and Ian Cutras outline why inference at scale is the next cash‑flow engine, noting that dozens of unicorns are racing to lock down deterministic performance, power efficiency, and cost advantages. Key insights include Groq’s Language Processing Unit, a 14 nm chip that eliminates caches, DRAM and out‑of‑order execution to guarantee compile‑time latency, and its upcoming 4 nm, stacked‑DRAM successor funded by a $700 million Series D. Etched’s SOHU ASIC, built on TSMC’s 4 nm node, forgoes all flexibility to run transformers exclusively, claiming 500 k Llama 70B tokens per second—an order of magnitude ahead of Nvidia’s Blackwell. Meanwhile, New chips’ Raptor accelerator balances modest 8‑10 tps per chip latency with on‑device vector search, targeting enterprise workloads where power and latency trump raw throughput. Samanova’s SN40L leverages a coarse‑grained reconfigurable array, 520 MB SRAM and 64 GB HPM to serve multi‑trillion‑parameter models with micro‑second model‑switching, sold as a fully integrated rack. Talis bets on a “hard‑core model‑as‑silicon” approach, recompiling each model onto a custom chip for thousand‑fold efficiency gains, while Posetron’s FPGA‑based Atlas card promises 70 % faster token rates than Nvidia Hopper by exploiting HBM‑enabled Altera Agile FPGAs. Notable quotes underscore the stakes: Groq’s acquisition by Nvidia was announced on Christmas Eve 2025, Etched’s CEO admits, “If transformers lose, we lose,” and Talis’s founder emphasizes eliminating every runtime abstraction. Posetron’s founders, former Groq engineers, tout 93 % memory‑bandwidth utilization on DDR‑only ASICs as a path to competitive performance without HBM. These anecdotes illustrate the spectrum from ultra‑flexible CPUs to single‑purpose ASICs, each carving a niche in the inference hierarchy. The implications are clear: investors must choose between flexibility and peak efficiency, while hyperscalers weigh deterministic latency against the risk of architectural lock‑in. As power‑hungry GPUs approach diminishing returns, specialized silicon—whether deterministic LPUs, transformer‑only ASICs, or model‑compiled chips—could reshape AI infrastructure economics, driving down cost per token and enabling new edge‑centric generative applications.
TechInsights hosted a data‑driven seminar for AWS automotive teams, highlighting a decisive shift from distributed electronic modules to centralized compute architectures in vehicles. The briefing detailed how high‑performance automotive SoCs, expanding memory bandwidth, and silicon‑carbide power devices are reshaping AI...
YieldWerx announced a forthcoming webinar that will teach semiconductor engineers how to implement co‑packaged photonics (CPO) across the full product lifecycle. The session, led by CEO/CTO Aftkhar Aslam, will detail the 12 cross‑domain challenges—from optical data complexity to test‑flow discontinuities—and...
CEA‑Leti, CEA‑List and Powerchip Semiconductor Manufacturing Corporation (PSMC) announced a partnership to embed CEA‑List’s RISC‑V processor IP and CEA‑Leti’s microLED silicon‑photonic chiplets into PSMC’s 3D‑stacking and interposer platform. The integration replaces traditional copper interconnects with short‑reach, high‑bandwidth optical links, delivering...
Pat’s Law of Something: All major vendors and suppliers are always in talks and testing. I do think this will move forward, though. Packaging is an easier commitment than wafers as it doesn’t force a redesign. Not easy but just...
Working on building a new cost per transistor model with updated economics since it's been a few years since I have seen one. So many more dynamics are needed than the ones from a few years ago, pre AI...
RISC‑V has moved from a promising ISA to a viable platform as the RVA23 baseline unifies high‑performance compute. Arm’s recent transition to a silicon‑first model reshapes the IP landscape, intensifying competition. Akeana’s Alpine test chip, taped out in a 4 nm...
Indium Corporation has signed a long‑term offtake framework with Flash Metals USA, a Metallium subsidiary, to purchase critical metals recovered from electronic scrap using Flash Joule Heating technology. The agreement covers gallium, germanium, copper, tin, gold and indium, with an...
Japan’s state‑backed chipmaker Rapidus has moved its IIM‑1 plant in Hokkaido from construction to an operating pilot line, delivering working two‑nanometer gate‑all‑around prototypes. The company secured a ¥267.6 billion ($1.7 billion) financing round led by the government and over 30 private partners,...

Intel’s upcoming Nova Lake Core Ultra 400 desktop CPUs, now expected in 2027, are rumored to pack up to 52 cores, higher clock speeds and a notable IPC uplift that could outpace AMD’s Zen 6 chips. Leaker HXL suggests Nova Lake’s P‑core...

Chinese manufacturer WCH’s CH32V006 RISC‑V microcontroller, launched in 2024, powers the new $2 WeAct CH32V006F8U6 Mini Core development board. The board features a 48 MHz 32‑bit RISC‑V2C core, 8 KB SRAM, 62 KB flash, USB‑C power, and selectable 3.3 V or 5 V I/O via...
People are using AI to break CUDA’s moat. Given any pytorch model, it profiles it, ranks bottlenecks by amdahl's law, writes triton or CUDA C++ replacements, and runs 300+ experiments overnight with no human in the loop. - 5.29x over pytorch eager...
I remember so many people thinking I was nuts to suggest that years ago that XPUs would be successful and that it would be a GPU-only world.

The video chronicles the rise of Very Long Instruction Word (VLIW) architectures, a radical approach that promises computers up to twenty‑plus times faster without exotic silicon. By shifting the burden of parallelism from hardware to a sophisticated compiler, VLIW packs...
#Technology #Newsletter #Semiconductor #Manufacturing #Product NLOG-298 | Semiconductor And Beyond Newsletter | The Semiconductor Product Types: https://newsletter.chetanpatil.in/p/semiconductor-and-beyond-newsletter-298/

Intel announced the Core Ultra X9 378H, a 16‑core Panther Lake processor that mirrors the X7 368H’s architecture but caps its turbo at 5 GHz, slightly below the X9 388H’s 5.1 GHz. It retains the Intel Arc B390 integrated GPU while stripping enterprise‑grade features such as vPro, AMT, and...

Intel’s upcoming Nova Lake‑S desktop processor line has a leaked 44‑core SKU, replacing the previously announced 42‑core model by using two identical 8P+12E compute tiles. The change frees up 6P+12E tiles, which could be sold as lower‑priced, locked non‑K variants...