Evaluation of a Speech Bandwidth Extension Algorithm Based on Vocal Tract Shape Estimation

Conference: IWAENC 2012 - International Workshop on Acoustic Signal Enhancement
09/04/2012 - 09/06/2012 at Aachen, Germany

Proceedings: IWAENC 2012

Pages: 4Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Katsir, Itai; Malah, David; Cohen, Israel (Department of Electrical Engineering, Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel)

Abstract:
In this paper, we evaluate a speech bandwidth extension (BWE) algorithm which involves phonetic and speaker dependent estimation of the high-band part of the spectral envelope. The BWE algorithm extracts speech phoneme information by using a hidden Markov model. Speaker vocal tract shape information corresponding to the wideband signal is extracted by a codebook search. Postprocessing of the estimated vocal tract shape using iterative tuning allows artifacts reduction in cases of erroneous estimation of speech phoneme or vocal tract shape. We present objective measurements results demonstrating the benefit of the iterative tuning. Subjective listening tests illustrate improved wideband quality in comparison to the input narrowband speech. The algorithm complexity is also analyzed. Index Terms— Bandwidth extension, speech processing, vocal tract area function, sensitivity function, MUSHRA.