A Comparative Pronunciation Mapping Approach using G2P Conversion for Anglicisms in German Speech Recognition

Conference: Speech Communication - 14th ITG Conference
09/29/2021 - 10/01/2021 at online

Proceedings: ITG-Fb. 298: Speech Communication

Pages: 5Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Pritzen, Julia (Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Sankt Augustin, Germany & TH Köln - University of Applied Sciences, Köln, Germany)
Gref, Michael; Schmidt, Christoph (Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Sankt Augustin, Germany)
Zuehlke, Dietlind (TH Köln - University of Applied Sciences, Köln, Germany)

Abstract:
Anglicisms pose a challenge in German speech recognition due to their irregular pronunciation compared to native German words. To solve this issue, we propose a comparative approach that uses both a German and an English grapheme-to-phoneme model to create Anglicism pronunciations. Comparing their confidence measures, we chose the best resulting pronunciations and added them to an Anglicism pronunciation dictionary. We allowed using English pronunciations within a German ASR model by using phoneme mapping to transform English phonemes to their most likely German equivalents. With our approach, we utilize the original pronunciations of the Anglicisms source language while keeping the German Anglicism pronunciations with high accuracy. Tested on a dedicated Anglicism evaluation set, we improved the recognition of Anglicisms compared to a baseline model, reducing the word error rate by 1.33 % relative and the Anglicism error rate by 4.08 % relative.