Search papers, labs, and topics across Lattice.
The paper introduces a robust human trajectory prediction method that leverages self-supervised skeleton representation learning to handle missing joint data common in real-world environments. A masked autoencoding approach is used to pretrain a skeleton representation model, enhancing robustness to occlusions. Experiments demonstrate improved prediction accuracy in occlusion-prone scenarios compared to baseline models, particularly in clean-to-moderate missingness regimes, without sacrificing performance on complete data.
Trajectory prediction models can now handle missing skeleton data in occluded environments thanks to a self-supervised pretraining approach that learns robust skeleton representations.
Human trajectory prediction plays a crucial role in applications such as autonomous navigation and video surveillance. While recent works have explored the integration of human skeleton sequences to complement trajectory information, skeleton data in real-world environments often suffer from missing joints caused by occlusions. These disturbances significantly degrade prediction accuracy, indicating the need for more robust skeleton representations. We propose a robust trajectory prediction method that incorporates a self-supervised skeleton representation model pretrained with masked autoencoding. Experimental results in occlusion-prone scenarios show that our method improves robustness to missing skeletal data without sacrificing prediction accuracy, and consistently outperforms baseline models in clean-to-moderate missingness regimes.