Microsoft ResearchINRIAParis-SaclayApr 15, 2026arXiv:2604.13739

Spectral Thompson sampling

Tomas Kocak, Michal Valko, Remi Munos, Shipra Agrawal

AI Summary

This paper introduces Spectral Thompson Sampling (SpectralTS), an algorithm designed for bandit problems where payoffs are smooth over an underlying graph, addressing scalability issues of traditional methods in recommender systems and advertising. SpectralTS leverages an effective dimension *d* to achieve computational efficiency. Analysis demonstrates that SpectralTS achieves a regret bound of *d*√(*T* ln *N*) with high probability, comparable to existing methods but with improved computational performance, validated through synthetic and real-world experiments.

Key Contribution

Spectral Thompson Sampling offers a computationally tractable alternative for bandit problems on graphs, achieving comparable regret bounds to existing methods while scaling efficiently to large action spaces.

Abstract

Thompson Sampling (TS) has attracted a lot of interest due to its good empirical performance, in particular in the computational advertising. Though successful, the tools for its performance analysis appeared only recently. In this paper, we describe and analyze SpectralTS algorithm for a bandit problem, where the payoffs of the choices are smooth given an underlying graph. In this setting, each choice is a node of a graph and the expected payoffs of the neighboring nodes are assumed to be similar. Although the setting has application both in recommender systems and advertising, the traditional algorithms would scale poorly with the number of choices. For that purpose we consider an effective dimension d, which is small in real-world graphs. We deliver the analysis showing that the regret of SpectralTS scales as d*sqrt(T ln N) with high probability, where T is the time horizon and N is the number of choices. Since a d*sqrt(T ln N) regret is comparable to the known results, SpectralTS offers a computationally more efficient alternative. We also show that our algorithm is competitive on both synthetic and real-world data.

Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Spectral Thompson sampling

Related Papers