Search papers, labs, and topics across Lattice.
This paper introduces Cross-Domain Feature Knowledge Distillation (CD-FKD) to improve single-domain generalization in object detection by training a student network on diversified (downscaled and corrupted) data to mimic the features of a teacher network trained on original source domain data. CD-FKD employs both global and instance-wise feature distillation to enable the student to extract robust object-centric features. Experiments show CD-FKD outperforms state-of-the-art methods in target domain generalization and source domain performance, demonstrating improved robustness to domain shifts.
Object detectors can be made significantly more robust to domain shifts by distilling knowledge from a teacher network trained on clean data to a student trained on downscaled and corrupted versions of the same data.
Single-domain generalization is essential for object detection, particularly when training models on a single source domain and evaluating them on unseen target domains. Domain shifts, such as changes in weather, lighting, or scene conditions, pose significant challenges to the generalization ability of existing models. To address this, we propose Cross-Domain Feature Knowledge Distillation (CD-FKD), which enhances the generalization capability of the student network by leveraging both global and instance-wise feature distillation. The proposed method uses diversified data through downscaling and corruption to train the student network, whereas the teacher network receives the original source domain data. The student network mimics the features of the teacher through both global and instance-wise distillation, enabling it to extract object-centric features effectively, even for objects that are difficult to detect owing to corruption. Extensive experiments on challenging scenes demonstrate that CD-FKD outperforms state-of-the-art methods in both target domain generalization and source domain performance, validating its effectiveness in improving object detection robustness to domain shifts. This approach is valuable in real-world applications, like autonomous driving and surveillance, where robust object detection in diverse environments is crucial.