HMM Embedded Conditional Vector Estimation Applied to Noisy Line Spectral Frequencies

Konferenz: Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
05.10.2016 - 07.10.2016 in Paderborn, Deutschland

Tagungsband: ITG-Fb. 267: Speech Communication

Seiten: 5Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Klein, Andre; Feldes, Stefan (Institute of Digital Signal Processing, University of Applied Sciences Mannheim, 68163 Mannheim, Germany)

Conditional Bayesian estimation is based on disturbed observations, some probability measure of their reliability and a-priori knowledge of the original’s statistics. In this paper an extended conditional vector estimator is developed to account for time-varying statistics of non-stationary vector sources. To this end we introduce an outer HMM, that allows modeling statistics, e.g., phoneme specifically, with particular multivariate Gaussian emissions to provide temporal and spatial correlations for the inner estimator. The scheme is exemplarily applied to enhance noisy line spectral frequencies (LSF). Training and evaluation is done based on an extensive speech database and an AWGN channel model. The results show consistent improvements, with higher gains for low channel SNR.