Language Recognition for SSB modulated HF Radio Signals of Short Duration

Konferenz: Speech Communication - 15th ITG Conference
20.09.2023-22.09.2023 in Aachen

doi:10.30420/456164047

Tagungsband: ITG-Fb. 312: Speech Communication

Seiten: 5Sprache: EnglischTyp: PDF

Autoren:
Cornaggia-Urrigshardt, Alessia; Fritz, Fabian; Henneke, Lukas; Kurth, Frank; Schlich, Christian; Wilkinghoff, Kevin (Fraunhofer FKIE, Wachtberg, Germany)

Inhalt:
Language recognition is an important processing step in many speech-related monitoring applications. However, in situations where the quality of the signals is severely degraded, as it is the case for audio signals obtained from radio frequency (RF) transmissions, reliably recognizing the spoken language may be challenging. This is especially true for applications where the speech segments are mostly of short duration. In this work, we focus on language recognition for the demanding case of RF transmissions in the high frequency (HF) band, which are transmitted using single-sideband modulation with suppressed carrier. We apply publicly available state-of-the-art approaches to this task and compare their performances depending on the input speech segment length for a dataset of HF signals recorded in the wild (i.e., natural speech recorded in realistic transmission conditions). Furthermore, a domain adaptation technique is proposed for this HF radio signal scenario and evaluated experimentally.