Mar 3, 2026arXiv:2603.02724

Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids

Mathuranathan Mayuravaani, W. Kleijn, W. Bastiaan Kleijn, A. Lensen, Andrew Lensen, Charlotte Sørensen, Charlotte Sorensen

AI Summary

This paper introduces a simulation-based data augmentation strategy using acoustic transfer functions (ATFs) to train a transformer-based classifier for single-microphone own voice detection (OVD) in hearing aids. The approach involves training on analytically generated ATFs and fine-tuning on numerically simulated ATFs from a rigid-sphere model to a detailed head-and-torso representation. Results show high accuracy on both simulated (95.52%) and real-world hearing aid recordings (80%), demonstrating the model's generalization capability.

Key Contribution

Achieve 80% accuracy on real-world hearing aid recordings for own voice detection using a single microphone, without real-world training data, by cleverly simulating acoustic transfer functions.

Abstract

This paper presents a simulation-based approach to own voice detection (OVD) in hearing aids using a single microphone. While OVD can significantly improve user comfort and speech intelligibility, existing solutions often rely on multiple microphones or additional sensors, increasing device complexity and cost. To enable ML-based OVD without requiring costly transfer-function measurements, we propose a data augmentation strategy based on simulated acoustic transfer functions (ATFs) that expose the model to a wide range of spatial propagation conditions. A transformer-based classifier is first trained on analytically generated ATFs and then progressively fine-tuned using numerically simulated ATFs, transitioning from a rigid-sphere model to a detailed head-and-torso representation. This hierarchical adaptation enabled the model to refine its spatial understanding while maintaining generalization. Experimental results show 95.52% accuracy on simulated head-and-torso test data. Under short-duration conditions, the model maintained 90.02% accuracy with one-second utterances. On real hearing aid recordings, the model achieved 80% accuracy without fine-tuning, aided by lightweight test-time feature compensation. This highlights the model's ability to generalize from simulated to real-world conditions, demonstrating practical viability and pointing toward a promising direction for future hearing aid design.

Data Curation & Synthetic Data Speech & Audio

Citation Metrics

Citations0

Influential citations0

References44

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids

Related Papers