Search papers, labs, and topics across Lattice.
The paper introduces AWDiff, a diffusion model for lung ultrasound (LUS) image synthesis that incorporates the a trous wavelet transform to preserve fine-scale structures often lost by standard downsampling in other generative models. By integrating the a trous wavelet transform, AWDiff avoids destructive downsampling, thus better preserving subtle diagnostic cues like B-lines and pleural irregularities. Experiments on a LUS dataset demonstrate that AWDiff achieves lower distortion and higher perceptual quality than existing methods, while also maintaining clinical diversity through semantic conditioning with BioMedCLIP.
By integrating a trous wavelets into a diffusion model, AWDiff generates higher-fidelity lung ultrasound images, preserving crucial diagnostic details often lost in standard diffusion-based augmentation.
Lung ultrasound (LUS) is a safe and portable imaging modality, but the scarcity of data limits the development of machine learning methods for image interpretation and disease monitoring. Existing generative augmentation methods, such as Generative Adversarial Networks (GANs) and diffusion models, often lose subtle diagnostic cues due to resolution reduction, particularly B-lines and pleural irregularities. We propose A trous Wavelet Diffusion (AWDiff), a diffusion based augmentation framework that integrates the a trous wavelet transform to preserve fine-scale structures while avoiding destructive downsampling. In addition, semantic conditioning with BioMedCLIP, a vision language foundation model trained on large scale biomedical corpora, enforces alignment with clinically meaningful labels. On a LUS dataset, AWDiff achieved lower distortion and higher perceptual quality compared to existing methods, demonstrating both structural fidelity and clinical diversity.