Search papers, labs, and topics across Lattice.
This paper introduces ORDERED, a variance reduction technique for unsupervised domain adaptation (UDA) that optimizes the order in which training data is sampled to minimize domain discrepancy estimation error. They formulate the stochastic estimation error of domain discrepancy losses (CORAL and MMD) as a function of data sampling order and propose a practical optimization algorithm to find a better ordering. Experiments on image classification benchmarks demonstrate reduced variance and improved target domain accuracy compared to standard UDA methods.
Training data order matters more than you think: reordering your data can significantly improve unsupervised domain adaptation by reducing variance in domain discrepancy estimates.
Domain shift remains a key challenge in deploying machine learning models to the real world. Unsupervised domain adaptation (UDA) aims to address this by minimising domain discrepancy during training, but the discrepancy estimates suffer from high variance in stochastic settings, which can stifle the theoretical benefits of the method. This paper proposes Optimal Reordering of Data for Error-Reduced Estimation of Discrepancy (ORDERED), a novel unbiased stochastic variance reduction technique which reduces the discrepancy estimation error by optimising the order in which the training data are sampled. We consider two specific domain discrepancy losses (correlation alignment and the maximum mean discrepancy), formulate their stochastic estimation error as a function of the data sampling order, and propose a practical optimisation algorithm. Our simulations demonstrate reduced variance compared to related methods, and experiments on two domain shift image classification benchmarks show improved target domain accuracy.