Search papers, labs, and topics across Lattice.
This paper addresses the challenge of predicting relative depth in football scenarios for the 2025 SoccerNet Monocular Depth Estimation Competition, where limited training samples complicate the task. By utilizing the zero-shot capabilities of models pretrained on extensive datasets, the authors effectively learn metric depth, which enhances their relative depth predictions. Their approach achieved a competitive score of $2.68 \times 10^{-3}$ on the challenge set, demonstrating the efficacy of leveraging pretrained models in this context.
Achieving a score of $2.68 \times 10^{-3}$ in a depth estimation challenge reveals the untapped potential of zero-shot learning in complex visual tasks.
We present our solution to the 2025 SoccerNet Monocular Depth Estimation Competition Challenge. Predicting the relative depth in football scenarios is challenging, especially with only thousands of training samples available. To address this issue, our method leverages the powerful zero-shot capabilities of models pretrained on large-scale datasets to learn metric depth for effective relative depth prediction, achieving a score of $2.68 \times 10^{-3}$ on the challenge set.