Search papers, labs, and topics across Lattice.
This paper introduces ORSIFlow, a saliency-guided rectified flow framework for salient object detection in optical remote sensing images. It leverages a frozen variational autoencoder to perform saliency mask generation in a compact latent space, enabling efficient inference. The framework incorporates a Salient Feature Discriminator for global semantic discrimination and a Salient Feature Calibrator for precise boundary refinement, achieving state-of-the-art performance with improved efficiency.
Saliency detection in remote sensing gets a huge speed boost with ORSIFlow, a rectified flow approach that leaps ahead of slower diffusion-based methods.
Optical Remote Sensing Image Salient Object Detection (ORSI-SOD) remains challenging due to complex backgrounds, low contrast, irregular object shapes, and large variations in object scale. Existing discriminative methods directly regress saliency maps, while recent diffusion-based generative approaches suffer from stochastic sampling and high computational cost. In this paper, we propose ORSIFlow, a saliency-guided rectified flow framework that reformulates ORSI-SOD as a deterministic latent flow generation problem. ORSIFlow performs saliency mask generation in a compact latent space constructed by a frozen variational autoencoder, enabling efficient inference with only a few steps. To enhance saliency awareness, we design a Salient Feature Discriminator for global semantic discrimination and a Salient Feature Calibrator for precise boundary refinement. Extensive experiments on multiple public benchmarks show that ORSIFlow achieves state-of-the-art performance with significantly improved efficiency. Codes are available at: https://github.com/Ch3nSir/ORSIFlow.