Search papers, labs, and topics across Lattice.
The paper introduces InfScene-SR, a diffusion-based super-resolution framework that addresses the limitations of fixed-size patch processing in standard DDPMs by enabling spatially continuous inference for arbitrary-sized images. This is important because it allows for high-quality super-resolution of large scenes without memory constraints or boundary artifacts, which are common issues in existing methods. The key result is the successful generation of seamless, high-resolution imagery on remote sensing datasets, demonstrating improved perceptual quality and benefits for downstream tasks like semantic segmentation.
Achieve seamless, high-quality super-resolution on arbitrary-sized images with a diffusion model, sidestepping the limitations of fixed-size training and patch-based artifacts.
Image Super-Resolution (SR) aims to recover high-resolution (HR) details from low-resolution (LR) inputs, a task where Denoising Diffusion Probabilistic Models (DDPMs) have recently shown superior performance compared to Generative Adversarial Networks (GANs) based approaches. However, standard diffusion-based SR models, such as SR3, are typically trained on fixed-size patches and struggle to scale to arbitrary-sized images due to memory constraints. Applying these models via independent patch processing leads to visible seams and inconsistent textures across boundaries. In this paper, we propose InfScene-SR, a framework enabling spatially continuous super-resolution for large, arbitrary scenes. We adapt the iterative refinement process of diffusion models with a novel guided and variance-corrected fusion mechanism, allowing for the seamless generation of large-scale high-resolution imagery without retraining. We validate our approach on remote sensing datasets, demonstrating that InfScene-SR not only reconstructs fine details with high perceptual quality but also eliminates boundary artifacts, benefiting downstream tasks such as semantic segmentation.