Mar 29, 2026arXiv:2603.27820

Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning

Zhiwen You, Aniket Vashishtha, Simo Du, Gabriel Erion-Barner, Hongyuan Mei, Hao Peng, Yue Guo

AI Summary

This paper introduces a counterfactual multi-agent framework for clinical diagnosis, inspired by how clinicians use counterfactual reasoning to refine diagnoses. The framework uses counterfactual case editing to modify clinical findings and the Counterfactual Probability Gap to quantify the support individual findings lend to a diagnosis. Experiments across diagnostic benchmarks and LLMs show the framework improves diagnostic accuracy and produces more clinically useful reasoning compared to baselines.

Key Contribution

LLMs can diagnose better by explicitly reasoning about "what if" scenarios, just like doctors do in training.

Abstract

Clinical diagnosis is a complex reasoning process in which clinicians gather evidence, form hypotheses, and test them against alternative explanations. In medical training, this reasoning is explicitly developed through counterfactual questioning--e.g., asking how a diagnosis would change if a key symptom were absent or altered--to strengthen differential diagnosis skills. As large language model (LLM)-based systems are increasingly used for diagnostic support, ensuring the interpretability of their recommendations becomes critical. However, most existing LLM-based diagnostic agents reason over fixed clinical evidence without explicitly testing how individual findings support or weaken competing diagnoses. In this work, we propose a counterfactual multi-agent diagnostic framework inspired by clinician training that makes hypothesis testing explicit and evidence-grounded. Our framework introduces counterfactual case editing to modify clinical findings and evaluate how these changes affect competing diagnoses. We further define the Counterfactual Probability Gap, a method that quantifies how strongly individual findings support a diagnosis by measuring confidence shifts under these edits. These counterfactual signals guide multi-round specialist discussions, enabling agents to challenge unsupported hypotheses, refine differential diagnoses, and produce more interpretable reasoning trajectories. Across three diagnostic benchmarks and seven LLMs, our method consistently improves diagnostic accuracy over prompting and prior multi-agent baselines, with the largest gains observed in complex and ambiguous cases. Human evaluation further indicates that our framework produces more clinically useful, reliable, and coherent reasoning. These results suggest that incorporating counterfactual evidence verification is an important step toward building reliable AI systems for clinical decision support.

Natural Language Processing Reasoning & Chain-of-Thought Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning

Related Papers