Search papers, labs, and topics across Lattice.
1
0
2
Forget hand-engineering initial conditions for robust RL: this method *learns* which conditions are feasible while simultaneously training a safe policy.