Search papers, labs, and topics across Lattice.
1
0
3
Offline RL gets a boost: IPD leverages a world model and MPC to hallucinate optimal trajectories, distilling them into a Transformer policy for substantial performance gains.