Apr 21, 2026arXiv:2604.19639

Safety-Critical Contextual Control via Online Riemannian Optimization with World Models

AI Summary

This paper introduces a sample-based Penalized Predictive Control (PPC) framework for safety-critical contextual control using black-box simulators. The core idea is to learn a score-based density model of feasible actions conditioned on context, which induces a Riemannian geometry on the action space. The authors derive a contextual safety bound showing that the distance from the true feasibility manifold is controlled by score estimation error and the barrier curvature, demonstrating improved performance over baseline density models in a dynamic navigation task, especially after environment shifts.

Key Contribution

Safety guarantees for black-box control systems can be derived from the geometry of learned feasibility constraints, not just unknown dynamics.

Abstract

Modern world models are becoming too complex to admit explicit dynamical descriptions. We study safety-critical contextual control, where a Planner must optimize a task objective using only feasibility samples from a black-box Simulator, conditioned on a context signal $ξ_t$. We develop a sample-based Penalized Predictive Control (PPC) framework grounded in online Riemannian optimization, in which the Simulator compresses the feasibility manifold into a score-based density $\hat{p}(u \mid ξ_t)$ that endows the action space with a Riemannian geometry guiding the Planner's gradient descent. The barrier curvature $κ(ξ_t)$, the minimum curvature of the conditional log-density $-\ln\hat{p}(\cdot\midξ_t)$, governs both convergence rate and safety margin, replacing the Lipschitz constant of the unknown dynamics. Our main result is a contextual safety bound showing that the distance from the true feasibility manifold is controlled by the score estimation error and a ratio that depends on $κ(ξ_t)$, both of which improve with richer context. Simulations on a dynamic navigation task confirm that contextual PPC substantially outperforms marginal and frozen density models, with the advantage growing after environment shifts.

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Safety-Critical Contextual Control via Online Riemannian Optimization with World Models

Related Papers