Search papers, labs, and topics across Lattice.
This paper introduces the Multi-Frequency Expert Network (MFEN) for visible-infrared person re-identification (VI-ReID), addressing the significant modality discrepancy caused by varying lighting conditions. By employing a mixture-of-experts design, MFEN adaptively combines multiple frequency bands, enhancing the model's ability to extract identity-relevant features while mitigating the impact of irrelevant lighting variations. Experimental results across three datasets validate that MFEN outperforms existing methods, showcasing its robustness in diverse conditions through complementary modules like Random Frequency Augmentation and Frequency Auxiliary Optimization.
MFEN achieves superior VI-ReID performance by leveraging a multi-frequency approach that adapts to varying lighting conditions, outperforming traditional single-band methods.
Visible-infrared person re-identification (VI-ReID) is challenging due to the large modality discrepancy between visible and infrared images. We contend that this discrepancy is largely related to differing lighting conditions, including differences in light wavelength and light source type. Recently, frequency-based VI-ReID approaches have achieved notable success because frequency information can better extract identity-relevant contours and details while excluding irrelevant lighting and color. However, existing methods either do not distinguish different frequency bands or focus on only one band, which is insufficient under diverse lighting conditions. To perform comprehensive frequency domain learning, we propose a Multi-Frequency Expert Network (MFEN) that enables multi-frequency modulation and adaptively combines different bands through a mixture-of-experts design. We further introduce Random Frequency Augmentation (RFA) and Frequency Auxiliary Optimization (FAO) to better train MFEN. The three modules are complementary and jointly capture critical frequency-domain details for robust representation learning. Extensive experiments on three VI-ReID datasets demonstrate the effectiveness of our approach.