Search papers, labs, and topics across Lattice.
This paper introduces PCDC, a novel remote sensing image compression framework that combines a conditional diffusion decoder with a PPO-based block-wise bitrate allocation strategy. The PPO agent learns to optimize bitrate allocation across image blocks, enabling high compression ratios while preserving perceptual quality and task-relevant information. Experiments on DIV2K and a newly released high-resolution drone image dataset demonstrate compression ratios exceeding 19x and negligible performance loss in downstream object detection tasks.
Achieve >19x compression on high-resolution drone imagery without sacrificing object detection performance by intelligently allocating bitrates with a PPO-trained agent guiding a conditional diffusion model.
Existing remote sensing image compression methods still explore to balance high compression efficiency with the preservation of fine details and task-relevant information. Meanwhile, high-resolution drone imagery offers valuable structural details for urban monitoring and disaster assessment, but large-area datasets can easily reach hundreds of gigabytes, creating significant challenges for storage and long-term management. In this paper, we propose a PPO-based bitrate allocation Conditional Diffusion Compression (PCDC) framework. PCDC integrates a conditional diffusion decoder with a PPO-based block-wise bitrate allocation strategy to achieve high compression ratios while maintaining strong perceptual performance. We also release a high-resolution drone image dataset with richer structural details at a consistent low altitude over residential neighborhoods in coastal urban areas. Experimental results show compression ratios of 19.3x on DIV2K and 21.2x on the drone image dataset. Moreover, downstream object detection experiments demonstrate that the reconstructed images preserve task-relevant information with negligible performance loss.