Mar 17, 2026arXiv:2603.16816

WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation

Muhammad Aamir, Naoya Muramatsu, Sangyun Shin, Matthew Wijers, Jiaxing Jhong, Xinyu Hou, Amir Patel, Andrew Markham

AI Summary

The authors introduce WildDepth, a new multimodal dataset containing synchronized RGB and LiDAR data for diverse animal species in both domestic and wild settings, designed to address the lack of metrically scaled animal datasets for depth estimation. Experiments demonstrate that incorporating LiDAR data improves depth estimation accuracy by 10% RMSE and 3D reconstruction fidelity by 12% in Chamfer distance compared to RGB-only approaches. The release of WildDepth aims to promote the development of robust multimodal perception systems applicable across different environments.

Key Contribution

LiDAR data boosts animal depth estimation accuracy by 10% RMSE, revealing the power of multimodal data for 3D wildlife perception.

Abstract

Depth estimation and 3D reconstruction have been extensively studied as core topics in computer vision. Starting from rigid objects with relatively simple geometric shapes, such as vehicles, the research has expanded to address general objects, including challenging deformable objects, such as humans and animals. However, for the animal, in particular, the majority of existing models are trained based on datasets without metric scale, which can help validate image-only models. To address this limitation, we present WildDepth, a multimodal dataset and benchmark suite for depth estimation, behavior detection, and 3D reconstruction from diverse categories of animals ranging from domestic to wild environments with synchronized RGB and LiDAR. Experimental results show that the use of multi-modal data improves depth reliability by up to 10% RMSE, while RGB-LiDAR fusion enhances 3D reconstruction fidelity by 12% in Chamfer distance. By releasing WildDepth and its benchmarks, we aim to foster robust multimodal perception systems that generalize across domains.

Computer Vision Data Curation & Synthetic Data Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation

Related Papers