Anhui Province Key Laboratory of Digital SecurityCUHKUSTCJun 10, 2026arXiv:2606.12051

MFEN:Multi-Frequency Expert Network for Visible-Infrared Person Re-ID

Xulin Li, Yan Lu, Bin Liu, Qinhong Yang, Qi Chu, Tao Gong, Nenghai Yu

AI Summary

This paper introduces the Multi-Frequency Expert Network (MFEN) for visible-infrared person re-identification (VI-ReID), addressing the significant modality discrepancy caused by varying lighting conditions. By employing a mixture-of-experts design, MFEN adaptively combines multiple frequency bands, enhancing the model's ability to extract identity-relevant features while mitigating the impact of irrelevant lighting variations. Experimental results across three datasets validate that MFEN outperforms existing methods, showcasing its robustness in diverse conditions through complementary modules like Random Frequency Augmentation and Frequency Auxiliary Optimization.

Key Contribution

MFEN achieves superior VI-ReID performance by leveraging a multi-frequency approach that adapts to varying lighting conditions, outperforming traditional single-band methods.

Abstract

Visible-infrared person re-identification (VI-ReID) is challenging due to the large modality discrepancy between visible and infrared images. We contend that this discrepancy is largely related to differing lighting conditions, including differences in light wavelength and light source type. Recently, frequency-based VI-ReID approaches have achieved notable success because frequency information can better extract identity-relevant contours and details while excluding irrelevant lighting and color. However, existing methods either do not distinguish different frequency bands or focus on only one band, which is insufficient under diverse lighting conditions. To perform comprehensive frequency domain learning, we propose a Multi-Frequency Expert Network (MFEN) that enables multi-frequency modulation and adaptively combines different bands through a mixture-of-experts design. We further introduce Random Frequency Augmentation (RFA) and Frequency Auxiliary Optimization (FAO) to better train MFEN. The three modules are complementary and jointly capture critical frequency-domain details for robust representation learning. Extensive experiments on three VI-ReID datasets demonstrate the effectiveness of our approach.

Computer Vision Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

MFEN:Multi-Frequency Expert Network for Visible-Infrared Person Re-ID

Related Papers