Search papers, labs, and topics across Lattice.
This paper introduces GLeVE, a graph-guided framework for grounding radiology reports to 3D CT volumes by treating each lesion description as an atomic semantic unit. GLeVE uses relation-aware graph reasoning to encode organ attribution, attributes, and inter-lesion relations, combined with anatomy-aware proposal generation and octree-based refinement. Experiments on AbdomenAtlas 3.0 show that GLeVE achieves improved segmentation accuracy and lesion-level localization compared to existing methods.
Achieve verifiable clinical interpretation by grounding radiology reports to 3D CT volumes with a novel graph-guided lesion grounding framework that outperforms existing multimodal foundation models.
Grounding radiology report descriptions to 3D CT volumes is essential for verifiable clinical interpretation, yet remains challenging due to the semantic-spatial gap between free-text narratives and volumetric anatomy. Existing report-assisted and vision-language grounding methods typically rely on phrase-level alignment or dense pixel supervision, resulting in limited lesion-wise correspondence and suboptimal localization accuracy. We propose GLeVE, a graph-guided lesion grounding framework with anatomical prior verification and octree-based autoregressive refinement. GLeVE treats each lesion description as an atomic semantic unit and encodes organ attribution, attributes, and inter-lesion relations through relation-aware graph reasoning to produce discriminative lesion-wise queries. Anatomy-aware proposal generation with region-level verification enforces one-to-one text-lesion alignment, while hierarchical octree refinement progressively improves boundary delineation. Experiments on AbdomenAtlas 3.0 demonstrate consistent gains over classical multimodal foundation models and report-supervised baselines in both segmentation accuracy and lesion-level localization.