Search papers, labs, and topics across Lattice.
1
0
34
Semiparametric bandits can achieve $\tilde{O}(\sqrt{T})$ regret while retaining interpretability, thanks to a novel kernelized 蔚-greedy algorithm and Stein-based estimation.