Search papers, labs, and topics across Lattice.
This paper addresses the problem of person detection and tracking in industrial environments using LiDAR data acquired from an overhead crane, a domain with a significant shift from typical vehicle-centric datasets. To overcome the lack of suitable training data, the authors curated a new overhead LiDAR dataset with 3D human bounding box annotations. They then adapted and evaluated several 3D object detectors, finding that VoxelNeXt and SECOND backbones achieved the best performance, reaching an AP of 0.84 within a 5m radius.
Overhead crane LiDAR struggles with person detection, but VoxelNeXt and SECOND backbones can achieve 84% AP within 5m, closing the gap with standard driving datasets.
This paper investigates person detection and tracking in an industrial indoor workspace using a LiDAR mounted on an overhead crane. The overhead viewpoint introduces a strong domain shift from common vehicle-centric LiDAR benchmarks, and limited availability of suitable public training data. Henceforth, we curate a site-specific overhead LiDAR dataset with 3D human bounding-box annotations and adapt selected candidate 3D detectors under a unified training and evaluation protocol. We further integrate lightweight tracking-by-detection using AB3DMOT and SimpleTrack to maintain person identities over time. Detection performance is reported with distance-sliced evaluation to quantify the practical operating envelope of the sensing setup. The best adapted detector configurations achieve average precision (AP) up to 0.84 within a 5.0 m horizontal radius, increasing to 0.97 at 1.0 m, with VoxelNeXt and SECOND emerging as the most reliable backbones across this range. The acquired results contribute in bridging the domain gap between standard driving datasets and overhead sensing for person detection and tracking. We also report latency measurements, highlighting practical real-time feasibility. Finally, we release our dataset and implementations in GitHub to support further research