
What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado
In this episode, Martin Casado interviews Vishal Misra, a Columbia professor who has built a mathematical model of how large language models (LLMs) operate, treating them as massive sparse matrices that update token probabilities via Bayesian-like inference. Misra explains his early work using GPT‑3 for in‑context learning to translate natural language queries into a custom domain‑specific language for cricket statistics, and how this led to a series of papers showing that LLMs perform predictable, matrix‑based updates rather than true reasoning. He argues that while LLMs excel at pattern matching, achieving AGI will require continual post‑training learning and a shift from correlation to causal modeling. The conversation also touches on the debate over whether LLM behavior is genuinely Bayesian and the practical tools Misra created to probe token probabilities.
380: Customer Service's AI Shift: Zendesk CTO Adrian McDermott on Deterministic AI and Context Engineering
In this episode, Zendesk CTO Adrian McDermott discusses the company’s evolution from a product‑led startup to a AI‑driven leader in customer service, emphasizing the shift toward deterministic AI and context engineering. He explains how Zendesk uses generative AI tools to boost...

Open Source for Awkward Robots
In this episode, host Ryan Donovan chats with Jan Lipart, CEO and co‑founder of OpenMind, about their open‑source robotics platform OM1 that lets humanoid robots communicate internally via natural language and be governed by immutable, blockchain‑stored rules like Asimov's laws....

What It Takes to Clear a Million Crimes a Year with Flock Safety's CEO
In this episode, Garrett Langley, CEO of Flock Safety, explains how his company transformed neighborhood security by deploying license‑plate‑reading cameras, AI‑driven analytics, and drones that integrate with 911 calls to create a real‑time crime‑clearance operating system. He recounts the origin...

Even the Chip Makers Are Making LLMs
In this episode, NVIDIA VP of Generative AI Keri Britsky explains why a GPU chip maker is now deeply involved in building large language models (LLMs). She describes NVIDIA’s extreme hardware‑software co‑design process, where model development informs GPU architecture, precision...

AI-Assisted Coding Needs More than Vibes; It Needs Containers and Sandboxes
In this episode, Docker President Mark Cavett discusses how containers are becoming essential for safely running AI‑generated code, emphasizing the need for hardened images to bridge the trust gap. He explains Docker’s new open‑source Docker Hardened Images (DHI) catalog, which...

No Need for Ctrl+C when You Have MCP
In this episode, Ryan Donovan interviews David Soria Parra, co‑creator of the Model Context Protocol (MCP) and a technical staff member at Anthropic. They discuss the origin of MCP as a solution to the copy‑paste friction when using LLMs, its evolution...

Why Stack Overflow and Cloudflare Launched a Pay-per-Crawl Model
In this episode, Stack Overflow’s Janice Manningham and Josh Zhang chat with Cloudflare VP Will Allen about the newly launched pay‑per‑crawl model that lets publishers charge crawlers for access. They explain how AI‑driven content scraping has upended the traditional open‑versus‑block...

Data Is the New Oil, and Your Database Is the only Way to Extract It
In this episode, Ryan interviews Shireesh Thota, Corporate Vice President of Azure Databases at Microsoft, about the rapid evolution of Microsoft's database offerings, including SQL Server, Cosmos DB, and Postgres, and how they fit into a unified Azure data platform....

Even Your Voice Is a Data Problem
In this episode, Ryan interviews Scott Stephenson, CEO and co‑founder of Deepgram, about the latest advances in voice AI, focusing on how deep learning improves speech‑to‑text and text‑to‑speech accuracy across diverse dialects and noisy environments. They discuss Deepgram’s scalable, affordable...