Search papers, labs, and topics across Lattice.
1
0
2
Bayesian policy gradients slash sample complexity in RL by modeling the policy gradient as a Gaussian process, offering faster convergence and gradient uncertainty estimates.