Search papers, labs, and topics across Lattice.
Shanghai Jiao Tong University, The University of Texas at Austin, The Chinese University of Hong Kong
1
0
2
Entropy-regularized RL, though non-convex in the traditional sense, exhibits a Bellman-induced Polyak--艁ojasiewicz geometry that guarantees global convergence of Wasserstein Policy Gradient.