Search papers, labs, and topics across Lattice.
This paper introduces a sample-based Penalized Predictive Control (PPC) framework for safety-critical contextual control using black-box simulators. The core idea is to learn a score-based density model of feasible actions conditioned on context, which induces a Riemannian geometry on the action space. The authors derive a contextual safety bound showing that the distance from the true feasibility manifold is controlled by score estimation error and the barrier curvature, demonstrating improved performance over baseline density models in a dynamic navigation task, especially after environment shifts.
Safety guarantees for black-box control systems can be derived from the geometry of learned feasibility constraints, not just unknown dynamics.
Modern world models are becoming too complex to admit explicit dynamical descriptions. We study safety-critical contextual control, where a Planner must optimize a task objective using only feasibility samples from a black-box Simulator, conditioned on a context signal $ξ_t$. We develop a sample-based Penalized Predictive Control (PPC) framework grounded in online Riemannian optimization, in which the Simulator compresses the feasibility manifold into a score-based density $\hat{p}(u \mid ξ_t)$ that endows the action space with a Riemannian geometry guiding the Planner's gradient descent. The barrier curvature $κ(ξ_t)$, the minimum curvature of the conditional log-density $-\ln\hat{p}(\cdot\midξ_t)$, governs both convergence rate and safety margin, replacing the Lipschitz constant of the unknown dynamics. Our main result is a contextual safety bound showing that the distance from the true feasibility manifold is controlled by the score estimation error and a ratio that depends on $κ(ξ_t)$, both of which improve with richer context. Simulations on a dynamic navigation task confirm that contextual PPC substantially outperforms marginal and frozen density models, with the advantage growing after environment shifts.