Super-Wideband Extension of a Perceptual Based Echo Assessment Method for Aurally Adequate Evaluation of Residual Single Talk Echoes

Konferenz: Speech Communication - 13. ITG-Fachtagung Sprachkommunikation
10.10.2018 - 12.10.2018 in Oldenburg, Deutschland

Tagungsband: ITG-Fb. 282: Speech Communication

Seiten: 5Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Bleiholder, Stefan; Reimes, Jan; Kettler, Frank (HEAD acoustics GmbH, Herzogenrath, Germany)

Inhalt:
Speech communication beyond wideband is getting more and more common – necessitating sophisticated IP-based transmission technologies that provide higher acoustic bandwidths and introduce longer delays. These two effects significantly influence echo perception. Therefore, the results of a comprehensive third-party listening test conducted in a super-wideband scenario are used to extend the scope of an existing instrumental method for the prediction of echo perception. Furthermore, a modified prediction methodology using a Random Forest regression algorithm is introduced. The proposed model is trained with the auditory data from the super-wideband test corpus and the test corpora used in previous work on echo perception in narrowband and wideband scenarios. The combination of the new regression methodology and an improved echo analysis model provides estimated Mean Opinion Scores (MOS) for instrumental echo assessment in narrowband, wideband and super-wideband scenarios. The model shows very satisfying correlation to the underlying auditory data especially for the super-wideband case.