Exploiting Temporal Correlations in Joint Multichannel Speech Separation and Noise Suppression Using Hidden Markov Models

Conference: IWAENC 2012 - International Workshop on Acoustic Signal Enhancement
09/04/2012 - 09/06/2012 at Aachen, Germany

Proceedings: IWAENC 2012

Pages: 4Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Vu, Dang Hai Tran; Haeb-Umbach, Reinhold (Department of Communications Engineering, University of Paderborn, 33098 Paderborn, Germany)

Recently, we introduced a unified approach for blind source separation and speech enhancement using microphone arrays [1]. Based on a sparseness model for the observations we derived an expectationmaximization (EM) algorithm for blind system identification and dominant source activity detection. In this contribution we improve this approach in the following two aspects: Firstly, we extend the observation model to a hidden Markov model (HMM) to explicitly exploit temporal correlations. Secondly, we include power based speech presence probability estimation. These modifications result in significantly improved spectrograms of the speaker presence probabilities after source separation and improved performance with respect to interference, noise and artefact reduction of the overall system, as measured on the dataset of the Signal Separation Evaluation Campaign 2010. Index Terms — Microphone array, blind source separation, speech enhancement, hidden Markov model