PKUShanghai Ocean UniversitySuzhou University of Science and TechnologyTongjiMay 26, 2026arXiv:2605.26929

When Muon Optimizer Meets Adversarial Training: A Theoretical and Empirical Study

Jun Yan, Weiquan Huang, Jiankai Zuo, Yujian Mo, Chengliang Wu, Zeming Wei

AI Summary

This paper investigates the use of the Muon optimizer, which orthogonalizes matrix-valued updates, within adversarial training (AT) to improve robustness against adversarial attacks. The authors theoretically demonstrate that Muon imposes a spectral-norm stability ceiling on matrix updates, preventing uncontrolled spectral growth during training. Empirically, Muon demonstrates competitive performance with SGD on CNNs and significantly outperforms AdamW on both CNNs and ViTs across various $\ell_p$ threat models.

Key Contribution

Orthogonalized optimization with the Muon optimizer can significantly boost adversarial robustness, outperforming AdamW on both CNNs and ViTs, highlighting the importance of optimizer geometry in security-critical training.

Abstract

Adversarial training (AT) remains one of the most reliable empirical defenses against adversarial attacks. Its robustness critically depends on how the underlying min-max objective is optimized. In practice, Stochastic Gradient Descent (SGD) optimizer remains the default optimization choice for AT, whereas adaptive optimizers often improve standard training but may yield inferior robustness. Recently, the Muon optimizer, which orthogonalizes matrix-valued updates via an approximate polar decomposition, has achieved notable success in large-scale training at a memory cost comparable to SGD. This raises a security-relevant question: \textit{can orthogonalized optimization improve AT under strong and heterogeneous threat models?} Focusing on this problem, we conduct a comprehensive theoretical and empirical study. Theoretically, we show that Muon imposes a spectral-norm stability ceiling on matrix updates, limiting uncontrolled spectral growth in the training dynamics without explicitly shrinking the learned weights. Empirically, across five architectures and three $\ell_p$ threat models ($\ell_\infty$, $\ell_1$, $\ell_2$) and their union, Muon is competitive with SGD on CNNs and substantially outperforms AdamW on both CNNs and ViTs. These results identify optimizer geometry as a security-relevant factor in adversarial training, while clarifying the empirical regimes in which orthogonalized updates are beneficial. Overall, our findings highlight optimizer design as a security-critical component of AT.

Red-Teaming & Adversarial Robustness Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

When Muon Optimizer Meets Adversarial Training: A Theoretical and Empirical Study

Related Papers