Evaluation of Enhanced F0-Trajectories for Speech Detection and Classification in Acoustic Monitoring

Conference: Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
10/05/2016 - 10/07/2016 at Paderborn, Deutschland

Proceedings: Speech Communication

Pages: 4Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Kurth, Frank; Cornaggia-Urrigshardt, Alessia (Fraunhofer FKIE, Communication Systems, Fraunhoferstr. 20, 53343 Wachtberg, Germany)

Abstract:
We evaluate the performance of enhanced F0-features for robustly detecting speech segments in noisy acoustic monitoring recordings. F0-features are extracted from the spectrogram based on the recently introduced shift autocorrelation (shift-ACF) and subsequent trajectory extraction. Speech detection is performed in a two-stage approach, involving both a classification and a segment extraction stage. We systematically evaluate the shift-ACF features and the speech detection performance using (i) purely synthetically generated data, (ii) a mix of synthetic speech and real noise background, and (iii) real speech and real noise background. In reviewing their strengths and weaknesses it turns out that shift-ACF based F0-features outperform classical features in several scenarios.