CMU MLNorthwesternJun 14, 2026arXiv:2606.16022

$λ$-Reachability: Geometric-Horizon Safety Bellman Equations for Humanoid Safety

Rui Chen, Shangtao Li, Yifan Sun, Changliu Liu

AI Summary

This paper introduces $λ$-Reachability, a novel approach to Hamilton-Jacobi safety analysis that enhances safety evaluations for high-dimensional robotic systems. By employing a stochastic multi-step estimator with a geometrically distributed rollout horizon, it interpolates between local updates and long-horizon safety targets, allowing for more flexible and scalable safety assessments. Experimental results show that $λ$-Reachability outperforms traditional single-step temporal-difference methods in both safe-set boundary classification and safety margin estimation for humanoid robots.

Key Contribution

$λ$-Reachability improves safety analysis in robotics by significantly enhancing the accuracy of safety margin estimations and safe-set classifications.

Abstract

We introduce $λ$-Reachability, a scalable approach to Hamilton--Jacobi safety analysis for high-dimensional robotic systems. Unlike prior discounted formulations that rely on fixed one-step Bellman updates, $λ$-Reachability employs a stochastic multi-step estimator of the safety value, using a geometrically distributed rollout horizon together with a randomly absorbed terminal. Conceptually analogous to TD($λ$), $λ$-Reachability interpolates between local self-consistency updates and long-horizon max-over-trajectory safety targets via an interpretable horizon-control parameter. Unlike TD($λ$), where the terminal value is always incorporated in learning targets, the terminal safety value in $λ$-Reachability is only used at a probability controlled by parameter $δ$. We formally show that for $δ<1$, the update induces a contraction mapping that allows temporal-difference learning; as $λ\to 1$, the estimator recovers the undiscounted reachability objective. We apply $λ$-Reachability to high-dimensional safety learning problems with both simulated and real humanoid robots under balance and collision avoidance constraints. Experimental results demonstrate that $λ$-Reachability significantly improves both safe-set boundary classification and safety margin estimation compared to single-step temporal-difference baselines.

Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

$λ$-Reachability: Geometric-Horizon Safety Bellman Equations for Humanoid Safety

Related Papers