Search papers, labs, and topics across Lattice.
This paper investigates the impact of removing machine identity information at test time in anomalous sound detection (ASD) tasks. The authors demonstrate that the standard machine-wise evaluation protocol in ASD benchmarks can mask performance degradations and method-specific robustness differences when machine identity is unavailable during inference. By merging test recordings from multiple machines and evaluating them jointly, the study reveals that performance drops are correlated with the implicit machine identification accuracy of different ASD methods.
Blindly applying anomalous sound detection models trained with machine-specific data can lead to significant performance drops when the machine identity is unknown at test time, highlighting a critical gap in current ASD benchmarks.
Anomalous sound detection (ASD) benchmarks typically assume that the identity of the monitored machine is known at test time and that recordings are evaluated in a machine-wise manner. However, in realistic monitoring scenarios with multiple known machines operating concurrently, test recordings may not be reliably attributable to a specific machine, and requiring machine identity imposes deployment constraints such as dedicated sensors per machine. To reveal performance degradations and method-specific differences in robustness that are hidden under standard machine-wise evaluation, we consider a minimal modification of the ASD evaluation protocol in which test recordings from multiple machines are merged and evaluated jointly without access to machine identity at inference time. Training data and evaluation metrics remain unchanged, and machine identity labels are used only for post hoc evaluation. Experiments with representative ASD methods show that relaxing this assumption reveals performance degradations and method-specific differences in robustness that are hidden under standard machine-wise evaluation, and that these degradations are strongly related to implicit machine identification accuracy.