MLOps Community

MLOps Community

Publication
0 followers

Independent community publication on MLOps practices, tooling, and production ML.

Choosing the Right Model Is Hard. Maintaining Accuracy Is Harder.
VideoApr 1, 2026

Choosing the Right Model Is Hard. Maintaining Accuracy Is Harder.

Ash Lewis, founder and CEO of Fast Labs, opened the session by highlighting a growing pain point for AI product teams: picking the right large‑language model (LLM) and keeping its performance steady once it’s in production. He noted that the...

By MLOps Community
Stop Shipping on Vibes — How to Build Real Evals for Coding Agents
VideoMar 31, 2026

Stop Shipping on Vibes — How to Build Real Evals for Coding Agents

At the Coding Agents Conference, Braintrust’s developer advocate Jessica Wang warned that many AI coding teams are “shipping on vibes,” deploying agents without solid evaluation frameworks. She emphasized that without real eval datasets, scoring systems, and controlled experiments, organizations are...

By MLOps Community
Decomposing the Agent Orchestration System: Lessons Learned
VideoMar 31, 2026

Decomposing the Agent Orchestration System: Lessons Learned

At the Coding Agents Conference, Union.ai’s chief ML engineer Niels Bantilan warned that building agents is less about novel features and more about resilient infrastructure. He emphasized that durable, self‑healing, and easily debuggable systems prevent costly downtime. Bantilan highlighted Flyte’s...

By MLOps Community
How to Make a Coding Agent a General Purpose Agent - Harrison Chase
VideoMar 31, 2026

How to Make a Coding Agent a General Purpose Agent - Harrison Chase

At the Coding Agents Conference on March 3, 2026, LangChain CEO Harrison Chase and Arcade AI CTO Sam Partee delivered a keynote arguing that the real barrier to scaling AI agents is not model intelligence but foundational infrastructure. They highlighted...

By MLOps Community
How AI Agents Store Memmories
VideoMar 26, 2026

How AI Agents Store Memmories

The video explores how artificial‑intelligence agents manage memory, contrasting traditional file‑system storage with newer, more dynamic approaches. It highlights the distinction between personalization memory and task‑execution memory, and why the choice of storage architecture matters for different agent designs. For agents...

By MLOps Community
Your Code Remembers Where It Broke
VideoMar 25, 2026

Your Code Remembers Where It Broke

The video introduces Temporal’s ability to remember exactly where a piece of code failed and resume execution once the error is fixed. This feature eliminates the traditional need to restart a server or rewrite logic after a syntax or runtime...

By MLOps Community
Lessons From 25 Trillion Tokens — Scaling AI-Assisted Development at Kilo
VideoMar 24, 2026

Lessons From 25 Trillion Tokens — Scaling AI-Assisted Development at Kilo

Kilo’s co‑founder and CEO Scott outlined how the company processed more than 25 trillion tokens since its May launch and used that data to reshape software engineering. By treating 2027‑level AI tools as core collaborators, Kilo shifted developers from manual coders...

By MLOps Community
Performance Optimization and Software/Hardware Co-Design Across PyTorch, CUDA, and NVIDIA GPUs
VideoMar 23, 2026

Performance Optimization and Software/Hardware Co-Design Across PyTorch, CUDA, and NVIDIA GPUs

The conversation centers on performance optimization and software‑hardware co‑design spanning PyTorch, CUDA, and NVIDIA GPUs, highlighted by the launch of SageMaker HyperPod—a service that keeps GPUs pre‑warmed for instant swapping. The speaker also promotes his new O'Reilly book that stitches...

By MLOps Community
Explaining Durable Execution
VideoMar 23, 2026

Explaining Durable Execution

The video explains Temporal’s durable execution model, emphasizing that workflow code must be deterministic. By restricting programs to repeatable logic—no random number generators or external nondeterministic calls—Temporal ensures that rerunning a workflow with identical inputs yields the same results. Key insights...

By MLOps Community
The Promise of Serverless
VideoMar 21, 2026

The Promise of Serverless

The video revisits the original promise of serverless computing, explaining how the term emerged organically as developers imagined writing code, uploading it to a massive cloud, and letting the platform handle execution without manual server management. It highlights key attributes such...

By MLOps Community
The Hard Truth About Building AI Agents
VideoMar 19, 2026

The Hard Truth About Building AI Agents

The speaker warns that building agentic AI requires careful, selective instruction design because developers cannot embed every rule or constraint into prompts. Even with very large context windows, empirical limits mean only a fraction of tokens should be used effectively,...

By MLOps Community
“I’ll Burn Out 2 in Minutes”  The Brutal Reality of GPU Clusters
VideoMar 19, 2026

“I’ll Burn Out 2 in Minutes” The Brutal Reality of GPU Clusters

The speaker describes stress-testing new GPU clusters by immediately pushing them to maximum load and routinely causing about 2% of units to fail within minutes because many accelerators are engineered to run extremely hot and rely on substantial cooling. He...

By MLOps Community
Write Reliable Software with Temporal
VideoMar 17, 2026

Write Reliable Software with Temporal

The video introduces Temporal’s durable execution model as a way to boost developer productivity when building agentic systems. It explains how Temporal abstracts reliability concerns, allowing developers to write ordinary code that runs to completion despite cloud‑scale failures, flaky services,...

By MLOps Community
MLOps Coding Skills: Bridging the Gap Between Specs and Agents
NewsMar 3, 2026

MLOps Coding Skills: Bridging the Gap Between Specs and Agents

The article introduces Agent Skills, a lightweight markdown‑based tool that injects organization‑specific engineering standards into AI coding agents. By converting sections of the MLOps Coding Course into SKILL.md files, the author shows how agents can automatically apply preferred tools such...

By MLOps Community
Using Agents in Production: Past Present and Future // Euro Beinat
VideoMar 1, 2026

Using Agents in Production: Past Present and Future // Euro Beinat

Prosus announced it has shipped nearly 8,000 AI agents, with only 15% achieving production status while the remainder function as learning experiments. The data was presented at the Computer History Museum’s Coding Agents virtual conference on March 3, where industry...

By MLOps Community
Context Engineering 2.0: MCP, Agentic RAG & Memory // Simba Khadder
VideoFeb 28, 2026

Context Engineering 2.0: MCP, Agentic RAG & Memory // Simba Khadder

Simba Khadder unveiled Redis Context Engine at the Coding Agents Conference, positioning it as "Context Engineering 2.0" that merges retrieval, tool invocation, and memory into a single MCP‑native surface. The platform treats documents, databases, events, and live APIs as addressable resources via...

By MLOps Community
Enterprise-Ready MCP // Jiquan Ngiam
VideoFeb 28, 2026

Enterprise-Ready MCP // Jiquan Ngiam

The Coding Agents Conference on March 3 will feature Jiquan Ngiam discussing the rapid enterprise adoption of agents and Model Context Protocols (MCPs). Over 80 % of professional developers now use AI tools daily, and agentic coding platforms such as Claude Code...

By MLOps Community
MLflow Leading Open Source
VideoFeb 24, 2026

MLflow Leading Open Source

Databricks’ leaders Corey Zumar, Jules Damji, and Danny Chiao discussed the latest evolution of MLflow on the MLOps Podcast. The open‑source platform is being rebuilt to handle generative AI, agent workloads, and production‑grade governance, moving beyond its original data‑science‑only focus....

By MLOps Community
Simulate to Scale: How Realistic Simulations Power Reliable Agents in Production // Sachi Shah
VideoFeb 24, 2026

Simulate to Scale: How Realistic Simulations Power Reliable Agents in Production // Sachi Shah

At the Computer History Museum’s Coding Agents Conference, Sachi Shah presented how realistic, scalable simulations are essential for deploying reliable AI agents in production. She explained that simulations can mirror messy real‑world interactions—including multilingual dialogue, emotional states, background noise, and...

By MLOps Community
Yes, We Do Need MCP
VideoFeb 24, 2026

Yes, We Do Need MCP

The upcoming Coding Agents Conference will feature a deep‑dive into MCP, a stateful communication protocol designed for AI agents. Organizers argue that MCP’s built‑in statefulness differentiates it from gRPC and HTTP, enabling conversations to resume after interruptions. The talk will...

By MLOps Community
Building an Orchestration Layer for Agentic Commerce at Loblaws
VideoFeb 23, 2026

Building an Orchestration Layer for Agentic Commerce at Loblaws

The talk introduced Alfred, Loblaws’ production‑grade orchestration layer designed to power agentic commerce across its massive retail ecosystem. Built on Google Kubernetes Engine with a FastAPI gateway, Alfred abstracts LLM providers, leverages LangChain‑style execution graphs, and connects to over fifty...

By MLOps Community
Agents as Search Engineers // Santoshkalyan Rayadhurgam
VideoFeb 23, 2026

Agents as Search Engineers // Santoshkalyan Rayadhurgam

Santoshkalyan Rayadhurgam argues that the foundational assumption of classic retrieval—users supply fully formed intent—is collapsing, prompting a transition from deterministic, stateless pipelines to agentic, stateful search systems that reason across turns. He contrasts three generations: lexical BM25 pipelines, vector‑based RAG models,...

By MLOps Community
How AI Covered a Human’s Paternity Leave // Quinten Rosseel
VideoFeb 22, 2026

How AI Covered a Human’s Paternity Leave // Quinten Rosseel

During a head of data’s paternity leave, a logistics SaaS firm relied on an AI analyst named “Wobby” to handle incoming data questions. The agent answered roughly 60 % of queries, demonstrating that a well‑engineered AI can fill staffing gaps without...

By MLOps Community
MCP Security: The Exploit Playbook (And How to Stop Them)
VideoFeb 21, 2026

MCP Security: The Exploit Playbook (And How to Stop Them)

The video spotlights the rapid rise of the MCP (Model‑Centered Programming) standard since its November 2024 launch and the stark security lag that now threatens its expanding ecosystem. While major platforms are racing to support MCP, developers are left scrambling to...

By MLOps Community
The Future of Coding: AI Agents & the Next Tech Revolution // Ricky Doar
VideoFeb 21, 2026

The Future of Coding: AI Agents & the Next Tech Revolution // Ricky Doar

The conversation centers on Cursor, an AI‑driven coding assistant, and how developers are adapting to a new paradigm where large language models act as pair programmers or autonomous agents. Ricky Doar and his guest discuss the rapid adoption of Cursor...

By MLOps Community
Fast & Asynchronous: Drift Your AI, Not Your GPU Bill // Artem Yushkovskiy
VideoFeb 19, 2026

Fast & Asynchronous: Drift Your AI, Not Your GPU Bill // Artem Yushkovskiy

The talk introduced ASEA, an open‑source asynchronous‑actor framework designed to replace traditional batch pipelines for generative AI workloads. By decoupling each processing step into self‑hosted GPU actors that communicate via message queues, the team at a global food‑delivery platform eliminated...

By MLOps Community
Beyond the Gold Standard: Evaluating and Trusting Agents in the Wild // Sanjana Sharma
VideoFeb 19, 2026

Beyond the Gold Standard: Evaluating and Trusting Agents in the Wild // Sanjana Sharma

AI agents look impressive in demos, but production reliability hinges on context, evaluation, and trust. Sanjana Sharma argues enterprises must shift from model‑first to system‑first thinking, embedding explicit business rules, subject‑matter‑expert (SME) heuristics, and versioned context layers. The talk outlines three...

By MLOps Community
Speed and Scale: How Today's AI Datacenters Are Operating Through Hypergrowth
VideoFeb 10, 2026

Speed and Scale: How Today's AI Datacenters Are Operating Through Hypergrowth

The video, hosted by Chris, co‑founder and CEO of Netbox Labs, examines the unprecedented speed and scale of today’s AI datacenter construction. He frames Netbox as the de‑facto system‑of‑record that tracks everything from power and cooling to rack‑level configurations, giving...

By MLOps Community