May 6, 2026arXiv:2605.04749

Spatial-Magnifier: Spatial upsampling for multichannel speech enhancement

Dongheon Lee, Ashutosh Pandey, Sanjeel Parekh, Daniel D. E. Wong, Jacob Donley, Buye Xu, Juan Azcarreta

AI Summary

The paper introduces Spatial-Magnifier, a neural network that generates virtual microphone signals from a limited set of real microphone measurements to improve the spatial directivity of multichannel speech enhancement. This is important because it addresses the physical constraints of fitting large microphone arrays into edge devices. Experiments show that Spatial-Magnifier, combined with the Spatial Audio Representation Learning (SARL) framework, outperforms existing spatial upsampling baselines and nearly recovers oracle performance with all microphones.

Key Contribution

Unlock near-oracle speech enhancement performance from compact microphone arrays by virtually expanding their spatial coverage with a novel neural network.

Abstract

While the spatial directivity of multichannel speech enhancement algorithms improves with the number of microphones, fitting large capture arrays into real-world edge devices is typically limited by physical constraints. To overcome this limitation, we propose Spatial-Magnifier, a neural network designed to generate virtual microphone (VM) signals from a limited set of real microphone (RM) measurements. Moreover, we introduce the Spatial Audio Representation Learning (SARL) framework, which leverages estimated VM signals and features to condition a downstream speech enhancement system. Experimental results demonstrate that the proposed framework outperforms existing spatial upsampling baselines across various speech extraction systems, including end-to-end multichannel speech enhancement and neural beamforming. The proposed method nearly recovers the oracle performance achieved when all microphones are available.

Architecture Design (Transformers, SSMs, MoE)Speech & Audio Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References34

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Spatial-Magnifier: Spatial upsampling for multichannel speech enhancement

Related Papers