Advances In End-to-End Conversational Speech Quality Prediction

Konferenz: Speech Communication - 15th ITG Conference
20.09.2023-22.09.2023 in Aachen

doi:10.30420/456164012

Tagungsband: ITG-Fb. 312: Speech Communication

Seiten: 5Sprache: EnglischTyp: PDF

Autoren:
Bleiholder, Stefan; Rohrer, Nils (HEAD acoustics GmbH, Herzogenrath, Germany)
Kettler, Christian; Weyer, Steffen (Electrical Engineering and IT, FH Aachen, Aachen Germany)

Inhalt:
A model for the prediction of the overall end-to-end conversational speech quality (MOSE2E,G) in a telecommunication connection was developed recently, and validated by conversational tests (CT) in a VoIP scenario using handsets. This contribution now presents an expert subjects CT for the adaptation of the model using handsfree speakerphones (HFT) in vehicles. A configurable HFT evaluation board was used to replicate typical impairments. The expert test subjects rated the conversational speech quality on both sides of the connection. The auditory results provide information about the interrelations between different quality dimensions relevant for perceived speech quality and give advice to adapt the model for this use case. Laboratory tests with the HFT implementations gathered the necessary acoustic parameters in different conditions (e.g. with and without additional driving noise) for the model prediction. The results are promising and justify a more extensive CT with naïve test subjects.