Search papers, labs, and topics across Lattice.
This paper introduces an attention regulation framework to improve counterfactual chest X-ray (CXR) synthesis using diffusion models. The method uses organ masks to gate self-attention and anatomy-token cross-attention, thereby confining structural interactions to anatomical regions of interest and reducing distortions. Additionally, a pathology-guided module enhances pathology-token cross-attention within target lung regions and performs latent corrections based on attention-concentration energy, improving lesion localization and control. Experiments on CXR datasets demonstrate improved anatomical consistency and more precise pathological edits compared to standard diffusion editing.
Achieve anatomically consistent and controllable counterfactual CXR synthesis by steering diffusion model attention with organ masks and pathology guidance.
Counterfactual generation for chest X-rays (CXR) aims to simulate plausible pathological changes while preserving patient-specific anatomy. However, diffusion-based editing methods often suffer from structural drift, where stable anatomical semantics propagate globally through attention and distort non-target regions, and unstable pathology expression, since subtle and localized lesions induce weak and noisy conditioning signals. We present an inference-time attention regulation framework for reliable counterfactual CXR synthesis. An anatomy-aware attention regularization module gates self-attention and anatomy-token cross-attention with organ masks, confining structural interactions to anatomical ROIs and reducing unintended distortions. A pathology-guided module enhances pathology-token cross-attention within target lung regions during early denoising and performs lightweight latent corrections driven by an attention-concentration energy, enabling controllable lesion localization and extent. Extensive evaluations on CXR datasets show improved anatomical consistency and more precise, controllable pathological edits compared with standard diffusion editing, supporting localized counterfactual analysis and data augmentation for downstream tasks.