Search papers, labs, and topics across Lattice.
1
0
2
Exponentially many policies in Tree MDPs don't have to mean exponential computation: clever confidence bounds let you treat policy selection as a tractable bandit problem.