Detection of Audio Events with Repetitive Structure Using Generalized Autocorrelations

Konferenz: Speech Communication - 11. ITG-Fachtagung Sprachkommunikation
24.09.2014 - 26.09.2014 in Erlangen, Deutschland

Tagungsband: Speech Communication

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Kurth, Frank; Cornaggia-Urrigshardt, Alessia (Fraunhofer FKIE, 53343 Wachtberg, Germany)

Inhalt:
We review several signal transforms for representing repeating structures within audio signals in the timefrequency domain. Based on a recently introduced generalized autocorrelation, the shift-ACF, we demonstrate how multiply repeated audio events may be better represented, hence improving detection performance. Using different examples from audio monitoring, we show how such signal transforms can be applied for audio event detection tasks in realistic scenarios. As a particular example we report on recent evaluations on speech detection in noisy recordings.