Mar 18, 2026arXiv:2603.17445

When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution

AI Summary

The paper introduces Implicit Execution Tracing (IET), a framework for attributing token-level contributions in multi-agent language systems when execution logs are unavailable. IET embeds agent-specific keyed signals into the token distribution during generation, creating a self-describing execution trace detectable only with a secret key. Experiments demonstrate that IET accurately recovers agent segments and coordination structure while maintaining generation quality, facilitating privacy-preserving auditing.

Key Contribution

You can now audit multi-agent LLM systems and trace responsibility for harmful outputs even without access to internal execution logs, thanks to a clever "self-describing text" technique.

Abstract

When a multi-agent system produces an incorrect or harmful answer, who is accountable if execution logs and agent identifiers are unavailable? Multi-agent language systems increasingly rely on structured interactions such as delegation and iterative refinement, yet the final output often obscures the underlying interaction topology and agent contributions. We introduce IET (Implicit Execution Tracing), a metadata-independent framework that enables token-level attribution directly from generated text and a simple mechanism for interaction topology reconstruction. During generation, agent-specific keyed signals are embedded into the token distribution, transforming the text into a self-describing execution trace detectable only with a secret key. At detection time, a transition-aware scoring method identifies agent handover points and reconstructs the interaction graph. Experiments show that IET recovers agent segments and coordination structure with high accuracy while preserving generation quality, enabling privacy-preserving auditing for multi-agent language systems.

Interpretability & Mechanistic Interp Natural Language Processing Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution

Related Papers