Search papers, labs, and topics across Lattice.
This paper addresses the inefficiencies in merging Low-Rank Adaptation (LoRA) experts by identifying and pruning "negative modules" that hinder performance. The authors introduce Evolutionary Negative Module Pruning (ENMP), an innovative method that employs an evolutionary search strategy to selectively exclude detrimental LoRA layers before merging. Their extensive evaluations reveal that ENMP significantly enhances the performance of existing merging techniques, setting a new benchmark in both language and vision tasks.
Pruning detrimental LoRA modules can lead to substantial performance gains in multi-task models, challenging the assumption that all components contribute positively.
Merging multiple Low-Rank Adaptation (LoRA) experts into a single backbone is a promising approach for efficient multi-task deployment. While existing methods strive to alleviate interference via weight interpolation or subspace alignment, they rest upon the implicit assumption that all LoRA matrices contribute constructively to the merged model. In this paper, we uncover a critical bottleneck in current merging paradigms: the existence of $\textit{negative modules}$ -- specific LoRA layers that inherently degrade global performance upon merging. We propose $\textbf{E}$volutionary $\textbf{N}$egative $\textbf{M}$odule $\textbf{P}$runing ($\textbf{ENMP}$), a plug-and-play LoRA pruning method to locate and exclude these detrimental modules prior to merging. By leveraging an evolutionary search strategy, ENMP effectively navigates the discrete, non-differentiable landscape of module selection to identify optimal pruning configurations. Extensive evaluations demonstrate that ENMP consistently boosts the performance of existing merging algorithms, achieving a new state-of-the-art across both language and vision domains. Code is available at https://github.com/CaoAnda/ENMP-LoRAMerging.