MilaRPIStevensThe Fin AIUniversity of SurreyJun 2, 2026arXiv:2606.03031

AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification

Yan Wang, Xuguang Ai, Jaisal Patel, Xueqing Peng, Fengran Mo, Yupeng Cao, Haohang Li, Mingyu Cao, Lingfei Qian, Víctor Gutiérrez-Basulto

AI Summary

This paper introduces AuditFlow, a graph-grounded multi-agent framework designed to enhance structured financial audit verification by linking reported facts to taxonomy concepts and performing necessary calculations. By utilizing a static US-GAAP taxonomy graph alongside a dynamic XBRL filing graph, the framework enables a structured approach to fact retrieval and rule evaluation, achieving an impressive 82.09% joint audit accuracy with GPT-5.5. The results highlight the critical role of deterministic verification in maintaining accuracy, as removing these checks drastically reduces performance to 17.91%.

Key Contribution

AuditFlow achieves over 82% accuracy in structured financial audits by leveraging a unique symbolic environment that outperforms traditional methods by nearly 15 points.

Abstract

Structured financial audit verification is difficult for language-model agents because correctness depends on structured evidence rather than text alone. A model must link reported facts to taxonomy concepts, traverse calculation or dimensional relations, and recompute expected values before applying an audit rule. We propose AuditFlow, a graph-grounded multi-agent framework that separates adaptive search from deterministic verification. AuditFlow builds a symbolic environment from a static US-GAAP taxonomy graph and a dynamic XBRL filing graph, and exposes it through typed tools for fact retrieval, taxonomy traversal, numerical checking, and rule evaluation. Two junior auditors inspect each case from regulatory and evidentiary views, while a senior auditor resolves disagreements and can request further investigation. The final reports are fused through evidential aggregation to produce an audit verdict, expected value, evidence trail, and trustworthiness score. On a FinAuditing-derived FinMR sample, AuditFlow reaches 82.09% joint audit accuracy under GPT-5.5, outperforming the strongest baseline by 14.93 points. Removing deterministic checks drops accuracy to 17.91%, showing that the symbolic environment performs the verification step that the model cannot reliably replace.

Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References29

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification

Related Papers