Advances In End-to-End Conversational Speech Quality Prediction

Conference: Speech Communication - 15th ITG Conference
09/20/2023 - 09/22/2023 at Aachen

doi:10.30420/456164012

Proceedings: ITG-Fb. 312: Speech Communication

Pages: 5Language: englishTyp: PDF

Authors:
Bleiholder, Stefan; Rohrer, Nils (HEAD acoustics GmbH, Herzogenrath, Germany)
Kettler, Christian; Weyer, Steffen (Electrical Engineering and IT, FH Aachen, Aachen Germany)

Abstract:
A model for the prediction of the overall end-to-end conversational speech quality (MOSE2E,G) in a telecommunication connection was developed recently, and validated by conversational tests (CT) in a VoIP scenario using handsets. This contribution now presents an expert subjects CT for the adaptation of the model using handsfree speakerphones (HFT) in vehicles. A configurable HFT evaluation board was used to replicate typical impairments. The expert test subjects rated the conversational speech quality on both sides of the connection. The auditory results provide information about the interrelations between different quality dimensions relevant for perceived speech quality and give advice to adapt the model for this use case. Laboratory tests with the HFT implementations gathered the necessary acoustic parameters in different conditions (e.g. with and without additional driving noise) for the model prediction. The results are promising and justify a more extensive CT with naïve test subjects.