Search papers, labs, and topics across Lattice.
Univ. of Southern California
1
0
2
Learning robotic reward functions from a million trajectories reveals that comparing entire trajectories, not just individual frames, unlocks better generalization and learning from suboptimal data.