Mar 3, 2026arXiv:2603.02794

Differentiable Time-Varying IIR Filtering for Real-Time Speech Denoising

R.P. Rota, Riccardo Rota, Kiril Ratmanski, Jozef Coldenhoff, Milos Cernak

AI Summary

The paper introduces Time-Varying Filtering (TVF), a 1M parameter speech enhancement model that combines deep learning with interpretable Digital Signal Processing (DSP) by predicting coefficients for a differentiable 35-band IIR filter cascade. This allows the model to adapt to non-stationary noise in real-time while maintaining an interpretable processing chain. Experiments on the Valentini-Botinhao dataset demonstrate TVF's effectiveness in speech denoising compared to static DDSP and fully deep-learning approaches.

Key Contribution

Achieve real-time speech denoising with a 1M parameter model that offers the interpretability of DSP and the adaptability of deep learning.

Abstract

We present TVF (Time-Varying Filtering), a low-latency speech enhancement model with 1 million parameters. Combining the interpretability of Digital Signal Processing (DSP) with the adaptability of deep learning, TVF bridges the gap between traditional filtering and modern neural speech modeling. The model utilizes a lightweight neural network backbone to predict the coefficients of a differentiable 35-band IIR filter cascade in real time, allowing it to adapt dynamically to non-stationary noise. Unlike ``black-box''deep learning approaches, TVF offers a completely interpretable processing chain, where spectral modifications are explicit and adjustable. We demonstrate the efficacy of this approach on a speech denoising task using the Valentini-Botinhao dataset and compare the results to a static DDSP approach and a fully deep-learning-based solution, showing that TVF achieves effective adaptation to changing noise conditions.

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Speech & Audio

Citation Metrics

Citations0

Influential citations0

References34

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Differentiable Time-Varying IIR Filtering for Real-Time Speech Denoising

Related Papers