Skip to main content

Building Reliable AI Agents: ADK, Arize Phoenix, and Observability

Google for DevelopersNovember 25, 202510 min3,732 views
19 connections·26 entities in this video

The Challenge of AI Agent Reliability

  • 💡 AI agents are fundamentally different from traditional software due to their non-deterministic nature and complex emergent behaviors.
  • ⚠️ Debugging AI agents involves more than just code; it requires inspecting tools, prompts, and reasoning.
  • 🎯 Observability is crucial for understanding agent behavior, moving from "I hope this works" to "I know how this works."

Google ADK and Arize Phoenix Integration

  • 🛠️ The Google Agent Development Kit (ADK) is designed with built-in plugins and callbacks to inspect agent actions like LLM and tool calls.
  • 🚀 Arize Phoenix is an open-source tool that integrates with ADK to provide tracing, evaluation, and iteration capabilities for AI agents.
  • 🔗 Integration involves instrumenting the ADK agent with a few lines of code using OpenTelemetry standards via Open Inference.

Live Tracing and Debugging

  • 🔍 Live tracing in Phoenix allows developers to see the agent's actions in real-time as it processes queries.
  • 📊 By comparing traces side-by-side, developers can identify logic errors, such as when an agent fails to call a necessary tool.
  • ✅ This process, known as error analysis, helps pinpoint issues even when no explicit exceptions are thrown.

Scaling Debugging with Evaluations

  • 📈 For large-scale debugging, evaluations (evals) are essential to assess agent outputs.
  • 📦 Offline evals are used during experimentation, similar to unit testing, before deployment.
  • 🌐 Online evals are critical once the application is live, helping to evaluate outputs and identify areas for improvement.
  • 🎯 An example eval discussed is answer correctness, using an LLM as a judge to determine if the agent's response adequately addresses the user's question.

Benefits of Open Standards and Collaboration

  • 💬 Open standards, like OpenTelemetry, create a common language for tools to communicate, simplifying data integration.
  • 🤝 The collaboration between ADK and Arize Phoenix demonstrates how open-source frameworks enable seamless integrations for enhanced debugging and iteration.
  • ✨ This combination allows teams to move from a prototype to a reliable, production-ready AI application.
Knowledge graph26 entities · 19 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover · drag to explore
26 entities
Chapters5 moments

Key Moments

Transcript38 segments

Full Transcript

Topics14 themes

What’s Discussed

AI AgentsObservabilityArize AIArize PhoenixGoogle ADKAgent Development KitLLMOpenTelemetryTracingDebuggingEvaluationsProduction AINon-deterministic SystemsGenAI
Smart Objects26 · 19 links
Products· 8
Concepts· 13
People· 2
Companies· 2
Media· 1