Feb 23, 2026arXiv:2602.19926

Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models

AI Summary

The paper addresses the performance degradation of Low-Rank Adaptation (LoRA) in differentially private federated learning (DPFL) for large vision and language models, identifying gradient coupling, noise amplification, and model sharpness as key challenges. To mitigate these issues, they propose Local Alternating LoRA (LA-LoRA), which decouples gradient interactions and aligns update directions across clients. Empirical results on Swin Transformer and RoBERTa models demonstrate that LA-LoRA achieves state-of-the-art performance under strict privacy constraints, significantly outperforming existing methods.

Key Contribution

LoRA in differentially private federated learning gets a 16% accuracy boost with LA-LoRA, which mitigates gradient coupling and noise amplification.

Abstract

Fine-tuning large vision models (LVMs) and large language models (LLMs) under differentially private federated learning (DPFL) is hindered by a fundamental privacy-utility trade-off. Low-Rank Adaptation (LoRA), a promising parameter-efficient fine-tuning (PEFT) method, reduces computational and communication costs by introducing two trainable low-rank matrices while freezing pre-trained weights. However, directly applying LoRA in DPFL settings leads to performance degradation, especially in LVMs. Our analysis reveals three previously underexplored challenges: (1) gradient coupling caused by the simultaneous update of two asymmetric low-rank matrices, (2) compounded noise amplification under differential privacy, and (3) sharpness of the global aggregated model in the parameter space. To address these issues, we propose LA-LoRA (\textbf{L}ocal \textbf{A}lternating \textbf{LoRA}), a novel approach that decouples gradient interactions and aligns update directions across clients to enhance robustness under stringent privacy constraints. Theoretically, LA-LoRA strengthens convergence guarantees in noisy federated environments. Extensive experiments demonstrate that LA-LoRA achieves state-of-the-art (SOTA) performance on Swin Transformer and RoBERTa models, showcasing robustness to DP noise and broad applicability across both LVMs and LLMs. For example, when fine-tuning the Swin-B model on the Tiny-ImageNet dataset under a strict privacy budget ($ε= 1$), LA-LoRA outperforms the best baseline, RoLoRA, by 16.83\% in test accuracy. Code is provided in \repolink.

Computer Vision Distributed Systems & Hardware Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models

Related Papers