Search papers, labs, and topics across Lattice.
This paper introduces $位$-Reachability, a novel approach to Hamilton-Jacobi safety analysis that enhances safety evaluations for high-dimensional robotic systems. By employing a stochastic multi-step estimator with a geometrically distributed rollout horizon, it interpolates between local updates and long-horizon safety targets, allowing for more flexible and scalable safety assessments. Experimental results show that $位$-Reachability outperforms traditional single-step temporal-difference methods in both safe-set boundary classification and safety margin estimation for humanoid robots.
$位$-Reachability improves safety analysis in robotics by significantly enhancing the accuracy of safety margin estimations and safe-set classifications.
We introduce $位$-Reachability, a scalable approach to Hamilton--Jacobi safety analysis for high-dimensional robotic systems. Unlike prior discounted formulations that rely on fixed one-step Bellman updates, $位$-Reachability employs a stochastic multi-step estimator of the safety value, using a geometrically distributed rollout horizon together with a randomly absorbed terminal. Conceptually analogous to TD($位$), $位$-Reachability interpolates between local self-consistency updates and long-horizon max-over-trajectory safety targets via an interpretable horizon-control parameter. Unlike TD($位$), where the terminal value is always incorporated in learning targets, the terminal safety value in $位$-Reachability is only used at a probability controlled by parameter $未$. We formally show that for $未<1$, the update induces a contraction mapping that allows temporal-difference learning; as $位\to 1$, the estimator recovers the undiscounted reachability objective. We apply $位$-Reachability to high-dimensional safety learning problems with both simulated and real humanoid robots under balance and collision avoidance constraints. Experimental results demonstrate that $位$-Reachability significantly improves both safe-set boundary classification and safety margin estimation compared to single-step temporal-difference baselines.