Detection of Audio Events with Repetitive Structure Using Generalized Autocorrelations

Conference: Speech Communication - 11. ITG-Fachtagung Sprachkommunikation
09/24/2014 - 09/26/2014 at Erlangen, Deutschland

Proceedings: Speech Communication

Pages: 4Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Kurth, Frank; Cornaggia-Urrigshardt, Alessia (Fraunhofer FKIE, 53343 Wachtberg, Germany)

Abstract:
We review several signal transforms for representing repeating structures within audio signals in the timefrequency domain. Based on a recently introduced generalized autocorrelation, the shift-ACF, we demonstrate how multiply repeated audio events may be better represented, hence improving detection performance. Using different examples from audio monitoring, we show how such signal transforms can be applied for audio event detection tasks in realistic scenarios. As a particular example we report on recent evaluations on speech detection in noisy recordings.