Mar 5, 2026arXiv:2603.05354

Exploring the potential and limitations of Model Merging for Multi-Domain Adaptation in ASR

Carlos Carvalho, Francisco Teixeira, T. Rolland, Thomas Rolland, Alberto Abad

AI Summary

This paper explores model merging as a computationally efficient alternative to fine-tuning for multi-domain adaptation in ASR, specifically for European Portuguese. They benchmark 11 merging algorithms and introduce BoostedTSV-M, a novel method that addresses rank collapse via singular-value boosting. The results demonstrate that BoostedTSV-M outperforms full fine-tuning on European Portuguese ASR across 10 domains while maintaining out-of-distribution generalization.

Key Contribution

Model merging can beat full fine-tuning for multi-domain ASR, offering a scalable path to adapt large speech models without prohibitive computational costs.

Abstract

Model merging is a scalable alternative to multi-task training that combines the capabilities of multiple specialised models into a single model. This is particularly attractive for large speech foundation models, which are typically adapted through domain-specific fine-tuning, resulting in multiple customised checkpoints, for which repeating full fine-tuning when new data becomes available is computationally prohibitive. In this work, we study model merging for multi-domain ASR and benchmark 11 merging algorithms for 10 European Portuguese domains, evaluating in-domain accuracy, robustness under distribution shift, as well as English and multilingual performance. We further propose BoostedTSV-M, a new merging algorithm based on TSV-M that mitigates rank collapse via singular-value boosting and improves numerical stability. Overall, our approach outperforms full fine-tuning on European Portuguese while preserving out-of-distribution generalisation in a single model.

Speech & Audio Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References56

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Exploring the potential and limitations of Model Merging for Multi-Domain Adaptation in ASR

Related Papers