Search papers, labs, and topics across Lattice.
This paper introduces a novel training data synthesis method to improve the generalization of homography estimation models across different image modalities. The method generates unaligned image pairs with ground-truth offsets from a single image, using diverse textures and colors while preserving structural information. A new network architecture is also proposed to leverage cross-scale information and decouple color information. Experiments demonstrate improved generalization performance and estimation accuracy compared to existing methods.
Synthetic training data can unlock robust homography estimation across diverse image modalities, even when real paired data is scarce.
Supervised and unsupervised homography estimation methods depend on image pairs tailored to specific modalities to achieve high accuracy. However, their performance deteriorates substantially when applied to unseen modalities. To address this issue, we propose a training data synthesis method that generates unaligned image pairs with ground-truth offsets from a single input image. Our approach renders the image pairs with diverse textures and colors while preserving their structural information. These synthetic data empower the trained model to achieve greater robustness and improved generalization across various domains. Additionally, we design a network to fully leverage cross-scale information and decouple color information from feature representations, thus improving estimation accuracy. Extensive experiments show that our training data synthesis method improves generalization performance. The results also confirm the effectiveness of the proposed network.