Mar 16, 2026arXiv:2603.14688

AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems

AI Summary

AgentTrace is introduced as a causal tracing framework for diagnosing failures in deployed multi-agent systems by reconstructing causal graphs from execution logs. It identifies root causes by tracing backward from error manifestations and ranking candidates using structural and positional signals. Experiments on a benchmark of multi-agent failure scenarios show that AgentTrace achieves high accuracy and sub-second latency in localizing root causes, outperforming heuristic and LLM-based baselines.

Key Contribution

Pinpointing the root cause of multi-agent system failures just got a whole lot faster: AgentTrace diagnoses errors with sub-second latency, leaving LLM-based debuggers in the dust.

Abstract

As multi-agent AI systems are increasingly deployed in real-world settings - from automated customer support to DevOps remediation - failures become harder to diagnose due to cascading effects, hidden dependencies, and long execution traces. We present AgentTrace, a lightweight causal tracing framework for post-hoc failure diagnosis in deployed multi-agent workflows. AgentTrace reconstructs causal graphs from execution logs, traces backward from error manifestations, and ranks candidate root causes using interpretable structural and positional signals - without requiring LLM inference at debugging time. Across a diverse benchmark of multi-agent failure scenarios designed to reflect common deployment patterns, AgentTrace localizes root causes with high accuracy and sub-second latency, significantly outperforming both heuristic and LLM-based baselines. Our results suggest that causal tracing provides a practical foundation for improving the reliability and trustworthiness of agentic systems in the wild.

Interpretability & Mechanistic Interp Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems

Related Papers