Mitchell Black

MIT Lincoln Laboratory {oswinso,eyyu}@mit.edu These authors contributed equally to this work. Abstract Recent advances in deep reinforcement learning (RL) have achieved strong results on high-dimensional control tasks, but applying RL to optimal safe controller synthesis raises a fundamental mismatch: optimal safe controller synthesis seeks to maximize the set of states from which a system remains safe indefinitely, while RL optimizes expected returns over a user-specified distribution. This mismatch can yield policies that perform poorly on low-probability states still within the safe set. A natural alternative is to frame the problem as a robust optimization over a set of initial conditions that specify the initial state, dynamics and safe set, but whether this problem has a solution depends on the feasibility of the specified set, which is unknown a priori. We propose Feasibility-Guided Exploration (FGE), a method that simultaneously identifies a subset of feasible initial conditions under which a safe policy exists, and learns a policy to solve the optimal control problem over this set of initial conditions. Empirical results demonstrate that FGE learns policies with over 50%50\% more coverage than the best existing method for challenging initial conditions across tasks in the MuJoCo simulator. The project page can be found at https://oswinso.xyz/fge.

MIT CSAIL

Papers on Lattice

Total citations

Topics

Research focus

Robotics & Embodied AI (1)World Models & Planning (1)

Frequent co-authors

Oswin So (1)Eric Yang Yu (1)Songyuan Zhang (1)Matthew Cleaveland (1)

Papers (1)

Feb 17, 2026

MIT CSAILFeb 17, 2026

Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning

Forget hand-engineering initial conditions for robust RL: this method *learns* which conditions are feasible while simultaneously training a safe policy.

Oswin So, Eric Yang Yu, Songyuan Zhang +3

Robotics & Embodied AI World Models & Planning

Search

Mitchell Black

Research focus

Frequent co-authors

Papers (1)