Search papers, labs, and topics across Lattice.
1
0
3
Q-value policies, traditionally outperformed by state-value policies in planning, can surpass them with the right regularization, offering a faster alternative for policy evaluation.