Search papers, labs, and topics across Lattice.
1
0
2
Forget hand-engineered reward functions: this method learns complex exploratory behaviors by simply predicting which states lead to unpredictable futures.