Mar 3, 2026arXiv:2603.03577

From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes

Qifan Zhang, Sai Haneesh Allu, Jikai Wang, Yangxiao Lu

AI Summary

L2G-Det, a novel instance detection framework, addresses the challenge of detecting and segmenting novel object instances in open-world environments using only a few template images. It bypasses explicit object proposals by employing dense patch-level matching between templates and the query image to generate candidate points, which are then refined. These filtered points prompt an augmented Segment Anything Model (SAM) with instance-specific object tokens, leading to improved instance mask reconstruction compared to proposal-based methods.

Key Contribution

Ditch unreliable object proposals: L2G-Det uses local patch matching to guide SAM for robust instance segmentation in cluttered, open-world scenes.

Abstract

Detecting and segmenting novel object instances in open-world environments is a fundamental problem in robotic perception. Given only a small set of template images, a robot must locate and segment a specific object instance in a cluttered, previously unseen scene. Existing proposal-based approaches are highly sensitive to proposal quality and often fail under occlusion and background clutter. We propose L2G-Det, a local-to-global instance detection framework that bypasses explicit object proposals by leveraging dense patch-level matching between templates and the query image. Locally matched patches generate candidate points, which are refined through a candidate selection module to suppress false positives. The filtered points are then used to prompt an augmented Segment Anything Model (SAM) with instance-specific object tokens, enabling reliable reconstruction of complete instance masks. Experiments demonstrate improved performance over proposal-based methods in challenging open-world settings.

Computer Vision Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes

Related Papers