Adobe ResearchINRIAParis-SaclayApr 28, 2026arXiv:2604.25272

Spectral bandits

Tomás Kocák, R. Munos, B. Kveton, Shipra Agrawal, Michal Valko, P. Auer

AI Summary

This paper introduces a spectral bandit framework for online learning on graphs, where arm payoffs are smooth with respect to graph structure. They define an "effective dimension" that captures the intrinsic complexity of real-world graphs and propose three algorithms with regret scaling linearly or sublinearly in this dimension. Experiments on content recommendation demonstrate effective learning of user preferences from limited node evaluations, even with thousands of items.

Key Contribution

Learn user preferences across thousands of items from just tens of node evaluations by exploiting graph smoothness in a new spectral bandit framework.

Abstract

Smooth functions on graphs have wide applications in manifold and semi-supervised learning. In this work, we study a bandit problem where the payoffs of arms are smooth on a graph. This framework is suitable for solving online learning problems that involve graphs, such as content-based recommendation. In this problem, each item we can recommend is a node of an undirected graph and its expected rating is similar to the one of its neighbors. The goal is to recommend items that have high expected ratings. We aim for the algorithms where the cumulative regret with respect to the optimal policy would not scale poorly with the number of nodes. In particular, we introduce the notion of an effective dimension, which is small in real-world graphs, and propose three algorithms for solving our problem that scale linearly and sublinearly in this dimension. Our experiments on content recommendation problem show that a good estimator of user preferences for thousands of items can be learned from just tens of node evaluations.

Recommendation & Information Retrieval

Citation Metrics

Citations12

Influential citations2

References60

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Spectral bandits

Related Papers