Search papers, labs, and topics across Lattice.
This paper introduces YCDa, a novel early-stage feature processing strategy for real-time camouflaged object detection inspired by human vision's ability to decouple chrominance and luminance. YCDa separates YCbCr color channels and applies dynamic channel attention to emphasize luminance and suppress misleading color cues. Integrating YCDa into existing detectors by replacing the first downsampling layer significantly improves performance with minimal overhead, achieving state-of-the-art results on COD-D datasets.
Mimicking human vision's ability to ignore misleading colors, YCDa boosts camouflaged object detection by 112% with a simple, plug-and-play module.
Human vision exhibits remarkable adaptability in perceiving objects under camouflage. When color cues become unreliable, the visual system instinctively shifts its reliance from chrominance (color) to luminance (brightness and texture), enabling more robust perception in visually confusing environments. Drawing inspiration from this biological mechanism, we propose YCDa, an efficient early-stage feature processing strategy that embeds this "chrominance-luminance decoupling and dynamic attention" principle into modern real-time detectors. Specifically, YCDa separates color and luminance information in the input stage and dynamically allocates attention across channels to amplify discriminative cues while suppressing misleading color noise. The strategy is plug-and-play and can be integrated into existing detectors by simply replacing the first downsampling layer. Extensive experiments on multiple baselines demonstrate that YCDa consistently improves performance with negligible overhead as shown in Fig. Notably, YCDa-YOLO12s achieves a 112% improvement in mAP over the baseline on COD10K-D and sets new state-of-the-art results for real-time camouflaged object detection across COD-D datasets.