Search papers, labs, and topics across Lattice.
This paper introduces a learning-based hierarchical scene graph matching pipeline for robot localization, leveraging prior maps from Building Information Models (BIM). The method augments scene graphs with semantically motivated edge types to encode intra- and inter-level relationships, enabling hierarchical matching from high-level room concepts to low-level wall surfaces. Trained on floor plans, the approach achieves higher F1 scores than combinatorial baselines on real LiDAR environments while being an order of magnitude faster, demonstrating zero-shot generalization for BIM-assisted robot localization.
Hierarchical scene graph matching, learned end-to-end, unlocks fast and accurate robot localization by grounding real-time sensor data against prior architectural maps.
Accurate localization is a fundamental requirement for autonomous robots operating in indoor environments. Scene graphs encode the spatial structure of an environment as a hierarchy of semantic entities and their relationships, and can be constructed both online from robot sensor data and offline from architectural priors such as Building Information Models (BIM). Matching these two complementary representations enables drift correction in SLAM by grounding robot observations against a known structural prior. However, establishing reliable node-to-node correspondences between them remains an open challenge: existing combinatorial methods are prohibitively expensive at scale, and prior learned approaches address only flat graph matching, ignoring the multi-level semantic structure present in both representations. Here we present a learned, end-to-end differentiable pipeline that augments both graphs with semantically motivated edge types encoding intra- and inter- level relationships, explicitly exploiting this hierarchy to enable simultaneous matching from high-level room concepts down to low-level wall surfaces. Trained exclusively on floor plans, the proposed method outperforms the combinatorial baseline in F1 on real LiDAR environments while running an order of magnitude faster, demonstrating viable zero-shot generalization for BIM-assisted robot localization.