Search papers, labs, and topics across Lattice.
This paper introduces a 3D counting approach for stacked objects in industrial settings, addressing the challenge of occlusion and irregular stacking that hinders existing methods. The method decomposes the counting task into 3D geometry estimation and occupancy ratio calculation from multi-view images. Experiments on synthetic and real-world datasets demonstrate the method's ability to accurately count parts even with significant occlusion and irregular stacking.
Accurately counting stacked objects in industrial settings, even with heavy occlusion, is now possible thanks to a novel 3D geometry and occupancy ratio estimation approach.
Visual object counting is a fundamental computer vision task in industrial inspection, where accurate, high-throughput inventory tracking and quality assurance are critical. Moreover, manufactured parts are often too light to reliably deduce their count from their weight, or too heavy to move the stack on a scale safely and practically, making automated visual counting the more robust solution in many scenarios. However, existing methods struggle with stacked 3D items in containers, pallets, or bins, where most objects are heavily occluded and only a few are directly visible. To address this important yet underexplored challenge, we propose a novel 3D counting approach that decomposes the task into two complementary subproblems: estimating the 3D geometry of the stack and its occupancy ratio from multi-view images. By combining geometric reconstruction with deep learning-based depth analysis, our method can accurately count identical manufactured parts inside containers, even when they are irregularly stacked and partially hidden. We validate our 3D counting pipeline on large-scale synthetic and diverse real-world data with manually verified total counts, demonstrating robust performance under realistic inspection conditions.