Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model

Konferenz: Speech Communication - 15th ITG Conference
20.09.2023-22.09.2023 in Aachen

doi:10.30420/456164022

Tagungsband: ITG-Fb. 312: Speech Communication

Seiten: 5Sprache: EnglischTyp: PDF

Autoren:
Lemercier, Jean-Marie; Gerkmann, Timo (Universität Hamburg, Germany)
Thiemann, Joachim; Koning, Raphael (Advanced Bionics, Hannover, Germany)

Inhalt:
In this paper we present a method for single-channel wind noise reduction using our previously proposed diffusionbased stochastic regeneration model combining predictive and generative modelling. We introduce a non-additive speech in noise model to account for the non-linear deformation of the membrane caused by the wind flow and possible clipping. We show that our stochastic regeneration model outperforms other neural-network-based wind noise reduction methods as well as purely predictive and generative models, on a dataset using simulated and realrecorded wind noise. We further show that the proposed method generalizes well by testing on an unseen dataset with real-recorded wind noise. Audio samples, data generation scripts and code for the proposed methods can be found online.