Search papers, labs, and topics across Lattice.
Google DeepMind, Microsoft Research Work done during the internship at Microsoft Research.
Google DeepMind1
15
3
4
Ditch the high-fidelity simulator: IRL-VLA uses a lightweight reward world model trained with inverse reinforcement learning to enable efficient and effective closed-loop RL training for autonomous driving.