Feb 26, 2026arXiv:2602.23276

CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays

HyunGyung Lee, Hyungyung Lee, Hangyul Yoon, Hangyul Yoon, Edward Choi, Edward Choi

AI Summary

The paper introduces CXReasonAgent, a diagnostic agent that combines a large language model (LLM) with clinically grounded diagnostic tools to perform evidence-grounded diagnostic reasoning on chest X-rays. This approach addresses limitations of large vision-language models (LVLMs) which often lack faithful grounding in diagnostic evidence and require retraining for new tasks. The authors demonstrate that CXReasonAgent generates more faithfully grounded and verifiable responses compared to LVLMs on CXReasonDial, a new multi-turn dialogue benchmark with 1,946 dialogues across 12 diagnostic tasks.

Key Contribution

Clinically-grounded diagnostic agents that reason about chest X-rays outperform large vision-language models in faithfulness and verifiability, without requiring costly retraining for new tasks.

Abstract

Chest X-ray plays a central role in thoracic diagnosis, and its interpretation inherently requires multi-step, evidence-grounded reasoning. However, large vision-language models (LVLMs) often generate plausible responses that are not faithfully grounded in diagnostic evidence and provide limited visual evidence for verification, while also requiring costly retraining to support new diagnostic tasks, limiting their reliability and adaptability in clinical settings. To address these limitations, we present CXReasonAgent, a diagnostic agent that integrates a large language model (LLM) with clinically grounded diagnostic tools to perform evidence-grounded diagnostic reasoning using image-derived diagnostic and visual evidence. To evaluate these capabilities, we introduce CXReasonDial, a multi-turn dialogue benchmark with 1,946 dialogues across 12 diagnostic tasks, and show that CXReasonAgent produces faithfully grounded responses, enabling more reliable and verifiable diagnostic reasoning than LVLMs. These findings highlight the importance of integrating clinically grounded diagnostic tools, particularly in safety-critical clinical settings.

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References25

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays

Related Papers