JiangnanMar 5, 2026arXiv:2603.04839

Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction

Yuanbo Li, Tianyang Xu, Cong Hu, Tao Zhou, Xiao-jun Wu, J. Kittler, Josef Kittler

AI Summary

The paper introduces Semantic-Augmented Dynamic Contrastive Attack (SADCA), a novel method for generating transferable adversarial examples against vision-language pre-training (VLP) models. SADCA dynamically disrupts cross-modal alignment by establishing a contrastive learning mechanism between adversarial, positive, and negative samples, thereby reinforcing semantic inconsistency. Experiments demonstrate that SADCA significantly improves adversarial transferability across multiple datasets and models compared to existing attack methods by also leveraging input transformations for semantic augmentation.

Key Contribution

Existing vision-language attacks are weak, but this new method, SADCA, uses dynamic contrastive learning and semantic augmentation to create adversarial examples that transfer much more effectively across models and datasets.

Abstract

With the rapid advancement and widespread application of vision-language pre-training (VLP) models, their vulnerability to adversarial attacks has become a critical concern. In general, the adversarial examples can typically be designed to exhibit transferable power, attacking not only different models but also across diverse tasks. However, existing attacks on language-vision models mainly rely on static cross-modal interactions and focus solely on disrupting positive image-text pairs, resulting in limited cross-modal disruption and poor transferability. To address this issue, we propose a Semantic-Augmented Dynamic Contrastive Attack (SADCA) that enhances adversarial transferability through progressive and semantically guided perturbation. SADCA progressively disrupts cross-modal alignment through dynamic interactions between adversarial images and texts. This is accomplished by SADCA establishing a contrastive learning mechanism involving adversarial, positive and negative samples, to reinforce the semantic inconsistency of the obtained perturbations. Moreover, we empirically find that input transformations commonly used in traditional transfer-based attacks also benefit VLPs, which motivates a semantic augmentation module that increases the diversity and generalization of adversarial examples. Extensive experiments on multiple datasets and models demonstrate that SADCA significantly improves adversarial transferability and consistently surpasses state-of-the-art methods. The code is released at https://github.com/LiYuanBoJNU/SADCA.

Computer Vision Multimodal Models Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References44

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Towards Highly Transferable Vision-Language Attack via Semantic-Augmented Dynamic Contrastive Interaction

Related Papers