Southwest UFeb 25, 2026arXiv:2602.21849

Meta-FC: Meta-Learning with Feature Consistency for Robust and Generalizable Watermarking

Yuheng Li, Weitong Chen, Chengcheng Zhu, Chunpeng Ge, Di Wu, Guodong Long

AI Summary

The paper introduces Meta-FC, a meta-learning based training strategy for improving the robustness and generalization of deep learning watermarking models. Meta-FC uses a meta-training task constructed from multiple sampled distortions and a held-out distortion for meta-testing, encouraging the model to identify stable neuron activations across distortions. A feature consistency loss is also introduced to promote distortion-invariant representations by ensuring consistent decoded features for the same image under different distortions.

Key Contribution

Randomly throwing distortions at your watermarking model during training? Meta-FC shows meta-learning a better way, boosting robustness by up to 4.71% against combined distortions.

Abstract

Deep learning-based watermarking has made remarkable progress in recent years. To achieve robustness against various distortions, current methods commonly adopt a training strategy where a \underline{\textbf{s}}ingle \underline{\textbf{r}}andom \underline{\textbf{d}}istortion (SRD) is chosen as the noise layer in each training batch. However, the SRD strategy treats distortions independently within each batch, neglecting the inherent relationships among different types of distortions and causing optimization conflicts across batches. As a result, the robustness and generalizability of the watermarking model are limited. To address this issue, we propose a novel training strategy that enhances robustness and generalization via \underline{\textbf{meta}}-learning with \underline{\textbf{f}}eature \underline{\textbf{c}}onsistency (Meta-FC). Specifically, we randomly sample multiple distortions from the noise pool to construct a meta-training task, while holding out one distortion as a simulated ``unknown'' distortion for the meta-testing phase. Through meta-learning, the model is encouraged to identify and utilize neurons that exhibit stable activations across different types of distortions, mitigating the optimization conflicts caused by the random sampling of diverse distortions in each batch. To further promote the transformation of stable activations into distortion-invariant representations, we introduce a feature consistency loss that constrains the decoded features of the same image subjected to different distortions to remain consistent. Extensive experiments demonstrate that, compared to the SRD training strategy, Meta-FC improves the robustness and generalization of various watermarking models by an average of 1.59\%, 4.71\%, and 2.38\% under high-intensity, combined, and unknown distortions.

Computer Vision Red-Teaming & Adversarial Robustness Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Meta-FC: Meta-Learning with Feature Consistency for Robust and Generalizable Watermarking

Related Papers