Search papers, labs, and topics across Lattice.
The paper introduces FedVG, a federated learning aggregation strategy that uses a global validation set to mitigate client drift caused by data heterogeneity. FedVG computes layer-wise gradient norms on the validation set to assess each client's generalization ability and derive client-specific scores for adaptive aggregation. Experiments on image datasets demonstrate that FedVG improves performance, especially in heterogeneous settings, and can be integrated with existing FL algorithms.
By weighting client updates based on validation gradient norms, FedVG offers a simple yet effective way to mitigate client drift in federated learning, outperforming volume-based aggregation strategies.
Federated Learning (FL) enables collaborative model training across multiple clients without sharing their private data. However, data heterogeneity across clients leads to client drift, which degrades the overall generalization performance of the model. This effect is further compounded by overemphasis on poorly performing clients. To address this problem, we propose FedVG, a novel gradient-based federated aggregation framework that leverages a global validation set to guide the optimization process. Such a global validation set can be established using readily available public datasets, ensuring accessibility and consistency across clients without compromising privacy. In contrast to conventional approaches that prioritize client dataset volume, FedVG assesses the generalization ability of client models by measuring the magnitude of validation gradients across layers. Specifically, we compute layerwise gradient norms to derive a client-specific score that reflects how much each client needs to adjust for improved generalization on the global validation set, thereby enabling more informed and adaptive federated aggregation. Extensive experiments on both natural and medical image benchmarking datasets, across diverse model architectures, demonstrate that FedVG consistently improves performance, particularly in highly heterogeneous settings. Moreover, FedVG is modular and can be seamlessly integrated with various state-of-the-art FL algorithms, often further improving their results. Our code is available at https://github.com/alinadevkota/FedVG.