How Jaeger Is Evolving to Trace AI Agents with OpenTelemetry

How Jaeger Is Evolving to Trace AI Agents with OpenTelemetry

CNCF Blog
CNCF BlogMay 26, 2026

Companies Mentioned

Why It Matters

By extending tracing to AI workloads, Jaeger gives enterprises faster root‑cause analysis of complex generative‑AI pipelines, reducing downtime and compliance risk.

Key Takeaways

  • Jaeger v2 now uses OpenTelemetry Collector as core ingestion engine
  • New protocols (MCP, ACP, AG-UI) enable AI‑agent assisted tracing
  • Unified binary runs same locally and in production for AI workloads
  • AI sidecars translate natural‑language queries into deterministic trace filters
  • Supports OpenTelemetry GenAI conventions to monitor RAG pipelines

Pulse Analysis

Observability tools have followed the evolution of software architecture. When microservices replaced monoliths, distributed tracing became a baseline capability, and Jaeger quickly became the de‑facto standard for visualizing request flows. Today, enterprises are embedding generative AI models, retrieval‑augmented generation pipelines, and autonomous agents directly into production services. These workloads introduce non‑linear execution paths—prompt assembly, vector‑store lookups, tool invocations—that stretch traditional span‑based tracing. Without a dedicated observability layer, engineers struggle to pinpoint latency spikes, token‑usage anomalies, or hallucination sources, prompting a new wave of tracing requirements.

Jaeger’s response arrives with the v2 release, which swaps its legacy collectors for the OpenTelemetry Collector framework. By ingesting OTLP natively, Jaeger consolidates metrics, logs, and traces into a single deployment, cutting translation latency and simplifying scaling. The project also layers three emerging standards—Model Context Protocol (MCP), Agent Client Protocol (ACP), and Agent‑User Interaction Protocol (AG‑UI)—to turn the tracing UI into an interactive workspace. ACP acts as a stateless translator, allowing natural‑language prompts to be converted into precise trace queries, while MCP secures model‑to‑data interactions. The updated UI, built on Zustand and React Query, streams trace context to an in‑app assistant that can summarize failure paths or highlight token consumption in real time.

From a business perspective, the unified Jaeger binary means teams can develop, test, and deploy AI‑aware tracing with identical configurations, reducing drift between staging and production. Organizations can start with on‑premise small language models for privacy‑sensitive debugging and later switch to cloud‑hosted LLMs for large‑scale incident analysis without rewiring the pipeline. By exposing standardized GenAI semantic conventions, Jaeger also future‑proofs observability investments as the OpenTelemetry community finalizes specifications for agentic systems. Early adopters stand to cut mean‑time‑to‑resolution on AI‑driven outages, improve compliance reporting, and gain actionable insight into token‑level performance.

How Jaeger is evolving to trace AI agents with OpenTelemetry

Comments

Want to join the conversation?

Loading comments...