Search papers, labs, and topics across Lattice.
The paper introduces General Intensity Direction (GID), a preprocessing technique that aligns images to a canonical reference frame based on a computed global orientation, enhancing rotation robustness in downstream tasks. This method addresses the limitations of standard neural networks in handling rotations without requiring architectural modifications or extensive data augmentation. Experiments on rotated MNIST and CIFAR-10 demonstrate that GID improves accuracy compared to rotation-invariant architectures and maintains effectiveness under complex conditions.
Achieve state-of-the-art rotation invariance in image analysis without modifying network architectures by simply preprocessing images with a global orientation alignment.
Handling geometric transformations, particularly rotations, remains a challenge in deep learning for computer vision. Standard neural networks lack inherent rotation invariance and typically rely on data augmentation or architectural modifications to improve robustness. Although effective, these approaches increase computational demands, require specialised implementations, or alter network structures, limiting their applicability. This paper introduces General Intensity Direction (GID), a preprocessing method that improves rotation robustness without modifying the network architecture. The method estimates a global orientation for each image and aligns it to a canonical reference frame, allowing standard models to process inputs more consistently across different rotations. Unlike moment-based approaches that extract invariant descriptors, this method directly transforms the image while preserving spatial structure, making it compatible with convolutional networks. Experimental evaluation on the rotated MNIST dataset shows that the proposed method achieves higher accuracy than state-of-the-art rotation-invariant architectures. Additional experiments on the CIFAR-10 dataset, confirm that the method remains effective under more complex conditions.