Search papers, labs, and topics across Lattice.
Stanford
1
7
2
12
Q-functions and implicit policy extraction are game-changers for batch online RL in robotics, unlocking significant performance gains over imitation-based approaches.