Search papers, labs, and topics across Lattice.
This paper introduces Hyperparameter Trajectory Inference (HTI), a novel task aimed at learning how a neural network's conditional output distribution changes with its hyperparameters to create a surrogate model, avoiding expensive retraining when user preferences evolve post-deployment. The authors propose a conditional Lagrangian optimal transport approach that jointly learns the Lagrangian function governing hyperparameter dynamics and the associated optimal transport maps and geodesics. By incorporating manifold hypothesis and least-action principles into the learned Lagrangian, the method improves surrogate model feasibility and outperforms alternatives in reconstructing NN outputs across hyperparameter spectra.
Imagine evolving user preferences post-deployment without expensive retraining: this paper offers a way to infer how neural network outputs change with hyperparameters, building surrogate models that adapt to new settings.
Neural networks (NNs) often have critical behavioural trade-offs that are set at design time with hyperparameters-such as reward weights in reinforcement learning or quantile targets in regression. Post-deployment, however, user preferences can evolve, making initial settings undesirable, necessitating potentially expensive retraining. To circumvent this, we introduce the task of Hyperparameter Trajectory Inference (HTI): to learn, from observed data, how a NN's conditional output distribution changes with its hyperparameters, and construct a surrogate model that approximates the NN at unobserved hyperparameter settings. HTI requires extending existing trajectory inference approaches to incorporate conditions, exacerbating the challenge of ensuring inferred paths are feasible. We propose an approach based on conditional Lagrangian optimal transport, jointly learning the Lagrangian function governing hyperparameter-induced dynamics along with the associated optimal transport maps and geodesics between observed marginals, which form the surrogate model. We incorporate inductive biases based on the manifold hypothesis and least-action principles into the learned Lagrangian, improving surrogate model feasibility. We empirically demonstrate that our approach reconstructs NN outputs across various hyperparameter spectra better than other alternatives.