Search papers, labs, and topics across Lattice.
This paper introduces a three-stage framework for training machine learning surrogates for optimization problems, using inexpensive, imperfect labels for pretraining followed by self-supervised refinement. The approach leverages a theoretical analysis showing that labels only need to place the model within a basin of attraction, reducing the need for high-quality, expensive data. Experiments across nonconvex optimization, power-grid operation, and dynamical systems demonstrate faster convergence, improved accuracy, and significant reductions in offline cost compared to existing methods.
Unlock up to 59x cost reductions in optimization by pretraining ML surrogates with cheap, imperfect labels and then refining them with self-supervision.
To scale the solution of optimization and simulation problems, prior work has explored machine-learning surrogates that inexpensively map problem parameters to corresponding solutions. Commonly used approaches, including supervised and self-supervised learning with either soft or hard feasibility enforcement, face inherent challenges such as reliance on expensive, high-quality labels or difficult optimization landscapes. To address their trade-offs, we propose a novel framework that first collects"cheap"imperfect labels, then performs supervised pretraining, and finally refines the model through self-supervised learning to improve overall performance. Our theoretical analysis and merit-based criterion show that labeled data need only place the model within a basin of attraction, confirming that only modest numbers of inexact labels and training epochs are required. We empirically validate our simple three-stage strategy across challenging domains, including nonconvex constrained optimization, power-grid operation, and stiff dynamical systems, and show that it yields faster convergence; improved accuracy, feasibility, and optimality; and up to 59x reductions in total offline cost.