Skip to main content

Building Reliable AI Agents: ADK, Arize Phoenix, and Observability

Google for DevelopersNovember 25, 202510 min3,732 views
19 connections·26 entities in this video→

The Challenge of AI Agent Reliability

  • πŸ’‘ AI agents are fundamentally different from traditional software due to their non-deterministic nature and complex emergent behaviors.
  • ⚠️ Debugging AI agents involves more than just code; it requires inspecting tools, prompts, and reasoning.
  • 🎯 Observability is crucial for understanding agent behavior, moving from "I hope this works" to "I know how this works."

Google ADK and Arize Phoenix Integration

  • πŸ› οΈ The Google Agent Development Kit (ADK) is designed with built-in plugins and callbacks to inspect agent actions like LLM and tool calls.
  • πŸš€ Arize Phoenix is an open-source tool that integrates with ADK to provide tracing, evaluation, and iteration capabilities for AI agents.
  • πŸ”— Integration involves instrumenting the ADK agent with a few lines of code using OpenTelemetry standards via Open Inference.

Live Tracing and Debugging

  • πŸ” Live tracing in Phoenix allows developers to see the agent's actions in real-time as it processes queries.
  • πŸ“Š By comparing traces side-by-side, developers can identify logic errors, such as when an agent fails to call a necessary tool.
  • βœ… This process, known as error analysis, helps pinpoint issues even when no explicit exceptions are thrown.

Scaling Debugging with Evaluations

  • πŸ“ˆ For large-scale debugging, evaluations (evals) are essential to assess agent outputs.
  • πŸ“¦ Offline evals are used during experimentation, similar to unit testing, before deployment.
  • 🌐 Online evals are critical once the application is live, helping to evaluate outputs and identify areas for improvement.
  • 🎯 An example eval discussed is answer correctness, using an LLM as a judge to determine if the agent's response adequately addresses the user's question.

Benefits of Open Standards and Collaboration

  • πŸ’¬ Open standards, like OpenTelemetry, create a common language for tools to communicate, simplifying data integration.
  • 🀝 The collaboration between ADK and Arize Phoenix demonstrates how open-source frameworks enable seamless integrations for enhanced debugging and iteration.
  • ✨ This combination allows teams to move from a prototype to a reliable, production-ready AI application.
Knowledge graph26 entities Β· 19 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover Β· drag to explore
26 entities
Chapters5 moments

Key Moments

Transcript38 segments

Full Transcript

Topics14 themes

What’s Discussed

AI AgentsObservabilityArize AIArize PhoenixGoogle ADKAgent Development KitLLMOpenTelemetryTracingDebuggingEvaluationsProduction AINon-deterministic SystemsGenAI
Smart Objects26 Β· 19 links
ProductsΒ· 8
ConceptsΒ· 13
PeopleΒ· 2
CompaniesΒ· 2
MediaΒ· 1