Search papers, labs, and topics across Lattice.
1
0
2
Semiparametric bandits can achieve $\tilde{O}(\sqrt{T})$ regret while retaining interpretability, thanks to a novel kernelized 蔚-greedy algorithm and Stein-based estimation.