Search papers, labs, and topics across Lattice.
This paper analyzes a multi-task learning formulation with misspecified perceptron models to understand the benefits of combining related tasks. It shows that multi-task learning is asymptotically equivalent to single-task learning with additional regularization terms, which improve generalization. Empirically, the study demonstrates that combining multiple tasks postpones and mitigates the double descent phenomenon.
Multi-task learning's generalization boost comes from implicit regularization, effectively postponing the dreaded double descent.
Multi--task learning seeks to improve the generalization error by leveraging the common information shared by multiple related tasks. One challenge in multi--task learning is identifying formulations capable of uncovering the common information shared between different but related tasks. This paper provides a precise asymptotic analysis of a popular multi--task formulation associated with misspecified perceptron learning models. The main contribution of this paper is to precisely determine the reasons behind the benefits gained from combining multiple related tasks. Specifically, we show that combining multiple tasks is asymptotically equivalent to a traditional formulation with additional regularization terms that help improve the generalization performance. Another contribution is to empirically study the impact of combining tasks on the generalization error. In particular, we empirically show that the combination of multiple tasks postpones the double descent phenomenon and can mitigate it asymptotically.