Objective Assessment of Artificial Speech Bandwidth Extension Approaches

Konferenz: Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
05.10.2016 - 07.10.2016 in Paderborn, Deutschland

Tagungsband: ITG-Fb. 267: Speech Communication

Seiten: 5Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Abel, Johannes; Fingscheidt, Tim (Institute for Communications Technology, Technische Universität Braunschweig, 38106 Braunschweig, Germany)
Kaniewska, Magdalena; Guillaume, Cyril; Tirry, Wouter (NXP Software, Leuven, Belgium)

Inhalt:
Demonstrated in various prior investigations, wideband PESQ and POLQA are not capable to predict speech quality of artificial speech bandwidth extension (ABE) test conditions accurately enough. Based on a multilingual subjective listening test, evaluating ABE solutions from six different institutions, we present a novel objective measure particularly being able to estimate the speech quality of ABE-processed speech signals. The proposed measure employs a variety of features for ABE-related distortion detection. A combination of perceptually and non-perceptually influenced features are used as input to a support-vector-machine-based high-performance statistical model for establishing a link to an accurate speech quality estimate. Evaluation of the measure employs the ITU-Trecommended metrics correlation and rank order. Testing the predictive power of the proposed measure is done in three crossvaldation setups: Disjoint languages, disjoint speakers, and disjoint ABE solutions. In a comparison, the proposed measure outperforms wideband PESQ and POLQA both in correlation and rank order.