Automatic Chat Transcription on a Firefighter TETRA Broadcast Channel

Konferenz: Sprachkommunikation - Beiträge zur 10. ITG-Fachtagung
26.09.2012 - 28.09.2012 in Braunschweig, Deutschland

Tagungsband: Sprachkommunikation

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Stein, Daniel; Schwenninger, Jochen; Usabaev, Bela (Fraunhofer Institute for Intelligent Analysis and Information Systems, Schloss Birlinghoven, 53754 Sankt Augustin, Germany)

Inhalt:
For a reliable keyword extraction on firefighter radio communication, a strong automatic speech recognition system is needed. However, real-life data poses several challenges like a distorted voice signal, background noise and several different speakers. In this paper, we review our experiences with the PRONTO corpus, which has been recorded during a firefighting exercise. Then, we proceed to present the benchmarks of our chat transcription system in terms of word error rate. Since a large amount of sentences in public safety communication share similar patterns, we also analyse the impact of the sentence complexity on the system performance, and further investigate how much training utterances are needed for a reliable speaker identification in this setting.