Search papers, labs, and topics across Lattice.
1
39
2
54
By modeling policy gradients as Gaussian processes, this work dramatically reduces the sample complexity in reinforcement learning, offering faster convergence and uncertainty estimates at little extra cost.