Search papers, labs, and topics across Lattice.
2
0
5
4
LLM-controlled robots can be made significantly safer by filtering unsafe natural language commands *before* they're executed, preventing downstream errors.
PRIMT tackles the data inefficiency of preference-based RL by using foundation models to generate synthetic multimodal feedback and synthesize trajectories, significantly outperforming existing FM-based and scripted baselines.