Feb 18, 2026arXiv:2602.16531

Transfer Learning of Linear Regression with Multiple Pretrained Models: Benefiting from More Pretrained Models via Overparameterization Debiasing

AI Summary

This paper investigates transfer learning for linear regression using multiple potentially overparameterized least-squares pretrained models, formulating the target learning task as a penalized optimization problem minimizing squared errors on the target dataset. The authors derive an analytical expression for the test error of the learned target model, demonstrating that using more pretrained models improves transfer learning, especially when debiasing overparameterized models. They propose a multiplicative correction factor to mitigate overparameterization bias, enabling the effective use of more pretrained models.

Key Contribution

Overparameterized pretrained models can hurt transfer learning in linear regression, but a simple debiasing technique unlocks the benefits of using many such models.

Abstract

We study transfer learning for a linear regression task using several least-squares pretrained models that can be overparameterized. We formulate the target learning task as optimization that minimizes squared errors on the target dataset with penalty on the distance of the learned model from the pretrained models. We analytically formulate the test error of the learned target model and provide the corresponding empirical evaluations. Our results elucidate when using more pretrained models can improve transfer learning. Specifically, if the pretrained models are overparameterized, using sufficiently many of them is important for beneficial transfer learning. However, the learning may be compromised by overparameterization bias of pretrained models, i.e., the minimum $\ell_2$-norm solution's restriction to a small subspace spanned by the training examples in the high-dimensional parameter space. We propose a simple debiasing via multiplicative correction factor that can reduce the overparameterization bias and leverage more pretrained models to learn a target predictor.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Transfer Learning of Linear Regression with Multiple Pretrained Models: Benefiting from More Pretrained Models via Overparameterization Debiasing

Related Papers