SydneyMay 6, 2026arXiv:2605.04530

SADE: Symptom-Aware Diagnostic Escalation for LLM-Based Network Troubleshooting

Kuan-Hao Tseng, Niruth Bogahawatta, Yasod Ginige, Kosta Dekic, Arunan Sivanathan, Suranga Seneviratne

AI Summary

The paper introduces SADE, a novel LLM agent for network troubleshooting that explicitly encodes the Cisco troubleshooting methodology. SADE employs a phase-gated diagnostic workflow, separating evidence acquisition from hypothesis commitment, and leverages a routed library of fault-family skills. Results on the NIKA benchmark show SADE achieves a 37 percentage point improvement in root-cause F1 score compared to a ReAct + GPT-4 baseline, with 22 points attributed to the diagnostic policy itself.

Key Contribution

LLMs can leapfrog current network troubleshooting benchmarks by explicitly encoding structured diagnostic policies, rather than relying on free-form deliberation.

Abstract

Large language model (LLM) agents are increasingly applied to network troubleshooting, but root-cause localization on public benchmarks remains well below practical deployment thresholds. We argue this is because existing agents do not encode the disciplined, layer-by-layer methodology that human network engineers use, and instead rely on free-form deliberation that conflates evidence acquisition with hypothesis commitment. We present SADE (Symptom-Aware Diagnostic Escalation), an agent that encodes the classical Cisco troubleshooting methodology as an explicit policy. SADE pairs a phase-gated diagnostic workflow, which separates evidence acquisition from hypothesis commitment, with a routed library of fault-family skills and high-yield diagnostic helpers. On a held-out 523 incident set of the public NIKA benchmark covering eleven unseen scenarios, SADE improves root-cause F1 by 37 percentage points over a ReAct + GPT-5 baseline; a model-controlled comparison against the same Claude Sonnet backend without the SADE policy attributes 22 of those points to the diagnostic policy alone, showing that the gain is not a side-effect of the model upgrade.

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References34

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

SADE: Symptom-Aware Diagnostic Escalation for LLM-Based Network Troubleshooting

Related Papers