Exploiting Temporal Correlation of Speech and Noise Magnitudes Using a Modified Kalman Filter for Speech Enhancement

Konferenz: Sprachkommunikation 2008 - 8. ITG-Fachtagung
08.10.2008 - 10.10.2008 in Aachen, Germany

Tagungsband: Sprachkommunikation 2008

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Esch, Thomas; Vary, Peter (Institute of Communication Systems and Data Processing, RWTH Aachen University, 52056 Aachen, Germany)

Inhalt:
A new speech enhancement algorithm using a modified Kalman filter in the frequency domain is proposed. The new approach consists of two steps. In the first step, the temporal trajectories of the speech and noise magnitudes are modeled by low order autoregressive (AR) processes, i.e., the current coefficients are propagated in time based on information taken from previous, enhanced coefficients, followed by a subsequent phase estimation. In the second step, the first estimation is updated. Therefore, two statistical estimators are utilized. The performance of the proposed method is shown to be considerably better than purely statistical estimators.