Search papers, labs, and topics across Lattice.
2
0
4
16
Instead of just preventing harmful actions by LM agents, we can now steer them back from the brink using human-aligned recovery plans, significantly improving safety after a mistake.
Users prefer robots that learn their preferences using CMA-ES-IG because it suggests more perceptually distinct and informative behaviors to rank.