Detection of Audio Events with Repetitive Structure Using Generalized Autocorrelations
Konferenz: Speech Communication - 11. ITG-Fachtagung Sprachkommunikation
24.09.2014 - 26.09.2014 in Erlangen, Deutschland
Tagungsband: Speech Communication
Seiten: 4Sprache: EnglischTyp: PDF
Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt
Autoren:
Kurth, Frank; Cornaggia-Urrigshardt, Alessia (Fraunhofer FKIE, 53343 Wachtberg, Germany)
Inhalt:
We review several signal transforms for representing repeating structures within audio signals in the timefrequency domain. Based on a recently introduced generalized autocorrelation, the shift-ACF, we demonstrate how multiply repeated audio events may be better represented, hence improving detection performance. Using different examples from audio monitoring, we show how such signal transforms can be applied for audio event detection tasks in realistic scenarios. As a particular example we report on recent evaluations on speech detection in noisy recordings.