Search papers, labs, and topics across Lattice.
This paper introduces a simulation-based data augmentation strategy using acoustic transfer functions (ATFs) to train a transformer-based classifier for single-microphone own voice detection (OVD) in hearing aids. The approach involves training on analytically generated ATFs and fine-tuning on numerically simulated ATFs from a rigid-sphere model to a detailed head-and-torso representation. Results show high accuracy on both simulated (95.52%) and real-world hearing aid recordings (80%), demonstrating the model's generalization capability.
Achieve 80% accuracy on real-world hearing aid recordings for own voice detection using a single microphone, without real-world training data, by cleverly simulating acoustic transfer functions.
This paper presents a simulation-based approach to own voice detection (OVD) in hearing aids using a single microphone. While OVD can significantly improve user comfort and speech intelligibility, existing solutions often rely on multiple microphones or additional sensors, increasing device complexity and cost. To enable ML-based OVD without requiring costly transfer-function measurements, we propose a data augmentation strategy based on simulated acoustic transfer functions (ATFs) that expose the model to a wide range of spatial propagation conditions. A transformer-based classifier is first trained on analytically generated ATFs and then progressively fine-tuned using numerically simulated ATFs, transitioning from a rigid-sphere model to a detailed head-and-torso representation. This hierarchical adaptation enabled the model to refine its spatial understanding while maintaining generalization. Experimental results show 95.52% accuracy on simulated head-and-torso test data. Under short-duration conditions, the model maintained 90.02% accuracy with one-second utterances. On real hearing aid recordings, the model achieved 80% accuracy without fine-tuning, aided by lightweight test-time feature compensation. This highlights the model's ability to generalize from simulated to real-world conditions, demonstrating practical viability and pointing toward a promising direction for future hearing aid design.