Search papers, labs, and topics across Lattice.
This paper introduces a lightweight depth-aware distillation framework that enhances a DINOv2-based visual place recognition model by incorporating geometric cues to address challenges posed by repetitive vegetation and appearance variations in forests. The approach was evaluated using the WildCross benchmark, where it outperformed an appearance-only model, showcasing the effectiveness of depth information in improving recognition robustness. These findings underscore the significance of integrating depth as a complementary modality for enhancing place recognition in complex natural environments.
Depth-aware distillation boosts visual place recognition in forests, revealing that geometric cues can dramatically enhance model robustness against appearance variations.
Visual place recognition in natural forest environments remains challenging due to repetitive vegetation, weak structural cues, and significant appearance variation across traversals. To address this limitation, this paper proposes a lightweight depth-aware distillation framework that injects geometric cues into a DINOv2-based place recognition model, while maintaining its pre-trained descriptor space. Evaluated on the recent WildCross benchmark, the proposed approach yields gains over an appearance-only counterpart, providing robustness to appearance variations. These results demonstrate the importance of depth as a strong complementary modality for place recognition in natural environments and identify depth-aware distillation as a promising direction for more robust forest perception.