NJUSTApr 29, 2026arXiv:2604.26388

SplitFT: An Adaptive Federated Split Learning System For LLMs Fine-Tuning

Yimeng Shan, Yimeng Shan, Zhaorui Zhang, Sheng Di, Sheng Di, Yu Liu, Yu Liu, Xiaoyi Lu, Benben Liu

AI Summary

This paper introduces SplitFT, a federated split learning system designed to address the challenges of fine-tuning large language models (LLMs) in heterogeneous client environments. SplitFT allows clients to adaptively set the cut layer based on their computational resources and model performance, and reduces communication overhead by lowering the LoRA rank at the cut layer. Experiments using a length-based Dirichlet data splitting approach demonstrate that SplitFT achieves superior fine-tuning efficiency and model performance compared to existing methods.

Key Contribution

Fine-tuning LLMs in federated settings just got easier: SplitFT lets clients adapt their cut layers and LoRA ranks, boosting performance and slashing communication costs.

Abstract

Federated Split Learning has been identified as an efficient approach to address the computational resource constraints of clients in classical federated learning, while guaranteeing data privacy for distributed model training across data owners. However, it faces some critical challenges when such a training strategy meets large language models (LLMs) for fine-tuning. Such challenges include setting the cutlayer adaptively across different clients to address the data and device heterogeneity issues, which affect the system performance significantly. In addition, efficiently reducing the communication overhead during the fine-tuning procedure is also another challenge. No work tries to address these challenges. To bridge this gap, we propose SplitTF, an adaptive federated split learning system for LLMs fine-tuning. SplitFT enables different clients to set different cut layers according to their computation resources and trained model performance. SplitFT also proposes to reduce the LoRA rank in cutlayer to reduce the communication overhead. In addition to simulating the heterogeneous data in real-world applications for our proposed split federated learning system, we propose a length-based Dirichlet approach to divide the training data into different clients. Extensive experimental results show that our proposed approach outperforms the state-of-the-art approach for fine-tuning time efficiency and model performance based on various popular benchmarks.

Distributed Systems & Hardware Natural Language Processing Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

SplitFT: An Adaptive Federated Split Learning System For LLMs Fine-Tuning

Related Papers