Search papers, labs, and topics across Lattice.
This paper investigates using image inpainting as an auxiliary task to improve the ability of deep learning models to extract individual-specific visual embeddings from animal skin patterns, specifically for zebrafish identification. They compare four encoder backbones, training them with a masked image reconstruction objective alongside the individual identification task. Results show that incorporating inpainting improves embedding clustering metrics and focuses attention on relevant skin patterns, as evidenced by GradCAM visualizations, leading to better individual identification.
For more reliable animal identification, force your model to reconstruct masked skin patterns, and it will learn embeddings that better capture individual differences.
In this paper, we explore deep learning techniques for individual identification of animals based on their skin patterns. Individual identification is crucial in biodiversity monitoring, since it enables analysis of decline or growth of populations, or intra-species interactions within populations. Models trained for the task of individual identification often do not focus on the skin pattern of animals, but on background details or body shape details. These characteristics are not individually specific, or can change drastically through time. We focus on techniques that will make machine learning models more responsive to skin pattern structure when extracting individual visual embeddings from images. For this, we explore image inpainting of task-specific masks as an auxiliary task to enhance ML-based individual identification from animal skin patterns. We propose a comparative analysis among four models as an encoder backbone for the individual identification task. We focus on the case study of zebrafish, which is a widely recognized biological model organism, and which exhibits individually identifying skin patterns. To evaluate encoder backbone performance, we present standard metrics for classification accuracy, embedding clustering metrics, and GradCAM visualizations.