Search papers, labs, and topics across Lattice.
This paper introduces Monte Carlo Pass Search (MCPS), a novel approach to evaluate football passes by framing the problem within a Monte Carlo Tree Search (MCTS) framework. Utilizing the first public high-fidelity tracking dataset with 3D ball trajectories from the Bundesliga, MCPS generates and evaluates counterfactual pass trajectories by sampling execution variants and scoring outcomes with a learned value model. The method achieves strong forecasting accuracy and enables distribution-aware attribution through complementary execution-surplus scores, enhancing the analysis and ranking of pass effectiveness.
MCPS reveals that analyzing football passes through a Monte Carlo framework can significantly improve the understanding of pass effectiveness and decision-making in real-time scenarios.
We recast pass evaluation in football (soccer) as a Monte Carlo Tree Search (MCTS)-like evaluation problem whose components mostly exist in the literature under different names: a value model (possession value), a world model (multi-agent trajectories with ball interactions), and a policy over counterfactual actions (sampling pass variants with noise). Building on the first public high-fidelity tracking dataset with 3D ball trajectories from the Bundesliga, we introduce Monte Carlo Pass Search (MCPS), which infers kick parameters for each observed pass, samples execution variants and option variants, rolls each candidate forward with a ball-conditioned world model until the next ball interaction, and scores outcomes with a learned value model to obtain a distribution over gained value. This distribution enables distribution-aware attribution with two complementary execution-surplus scores used for analysis and ranking: mean-based and percentile-based scores. To make the world model sample-efficient under limited public data, we adapt a discrete-token, autoregressive trajectory generator from autonomous driving (SMART) and show it yields strong best-of-20 forecasting accuracy compared to baselines, while supporting fully hypothetical rollouts for downstream evaluation. We have released model checkpoints and code.