Feb 17, 2026arXiv:2602.15473

POP: Prior-fitted Optimizer Policies

Jan Kobiolka, Christian Frey, Gresa Shala, Arlind Kadra, Erind Bedalli, Josif Grabocka

AI Summary

The paper introduces Prior-fitted Optimizer Policies (POP), a meta-learned optimizer that predicts coordinate-wise step sizes based on optimization trajectory context. POP is trained on a novel synthetic optimization problem prior encompassing both convex and non-convex objectives. Empirical results on a benchmark of 47 functions demonstrate that POP outperforms first-order methods, evolutionary strategies, Bayesian optimization, and other meta-learned optimizers without task-specific tuning.

Key Contribution

Meta-learned optimizers can now beat both gradient-based and black-box optimization methods on a diverse benchmark, without task-specific tuning.

Abstract

Optimization refers to the task of finding extrema of an objective function. Classical gradient-based optimizers are highly sensitive to hyperparameter choices. In highly non-convex settings their performance relies on carefully tuned learning rates, momentum, and gradient accumulation. To address these limitations, we introduce POP (Prior-fitted Optimizer Policies), a meta-learned optimizer that predicts coordinate-wise step sizes conditioned on the contextual information provided in the optimization trajectory. Our model is learned on millions of synthetic optimization problems sampled from a novel prior spanning both convex and non-convex objectives. We evaluate POP on an established benchmark including 47 optimization functions of various complexity, where it consistently outperforms first-order gradient-based methods, non-convex optimization approaches (e.g., evolutionary strategies), Bayesian optimization, and a recent meta-learned competitor under matched budget constraints. Our evaluation demonstrates strong generalization capabilities without task-specific tuning.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

POP: Prior-fitted Optimizer Policies

Related Papers