Comparison of Different Approaches for Speech Recognition in Hands-free Mode

Konferenz: Sprachkommunikation - Beiträge zur 10. ITG-Fachtagung
26.09.2012 - 28.09.2012 in Braunschweig, Deutschland

Tagungsband: Sprachkommunikation

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Hirsch, Hans-Günter (Institute for Pattern Recognition, Niederrhein University of Applied Sciences, Krefeld, Germany)
Ganapathy, Sriram; Hermansky, Hynek (Center for Language and Speech Processing, Johns Hopkins University, Baltimore, USA)

Inhalt:
To improve speech recognition in case of a hands-free speech input we apply the signal processing technique known as frequency domain linear prediction (FDLP). By analyzing the effects of reverberation we prove that this method is well suited to create robust acoustic features for this mode of speech input. Furthermore, we compare the efficiency of this robust feature extraction scheme with the alternative approach of adapting the Hidden Markov Models (HMMs). We especially investigate the influence of a varying distance between the speaker and the microphone in a reverberant room.