Exploiting Temporal Correlations in Joint Multichannel Speech Separation and Noise Suppression Using Hidden Markov Models

Konferenz: IWAENC 2012 - International Workshop on Acoustic Signal Enhancement
04.09.2012-06.09.2012 in Aachen, Germany

Tagungsband: IWAENC 2012

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Vu, Dang Hai Tran; Haeb-Umbach, Reinhold (Department of Communications Engineering, University of Paderborn, 33098 Paderborn, Germany)

Recently, we introduced a unified approach for blind source separation and speech enhancement using microphone arrays [1]. Based on a sparseness model for the observations we derived an expectationmaximization (EM) algorithm for blind system identification and dominant source activity detection. In this contribution we improve this approach in the following two aspects: Firstly, we extend the observation model to a hidden Markov model (HMM) to explicitly exploit temporal correlations. Secondly, we include power based speech presence probability estimation. These modifications result in significantly improved spectrograms of the speaker presence probabilities after source separation and improved performance with respect to interference, noise and artefact reduction of the overall system, as measured on the dataset of the Signal Separation Evaluation Campaign 2010. Index Terms — Microphone array, blind source separation, speech enhancement, hidden Markov model