Voice Activity Detection Based on Modulation-Phase Differences

Conference: Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
10/05/2016 - 10/07/2016 at Paderborn, Deutschland

Proceedings: Speech Communication

Pages: 5Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Graf, Simon (Acoustic Speech Enhancement Research, Nuance Communications Deutschland GmbH, Ulm, Germany & Digital Signal Processing and System Theory, Christian-Albrechts-Universität zu Kiel, Kiel, Germany)
Herbig, Tobias; Buck, Markus (Acoustic Speech Enhancement Research, Nuance Communications Deutschland GmbH, Ulm, Germany)
Schmidt, Gerhard (Digital Signal Processing and System Theory, Christian-Albrechts-Universität zu Kiel, Kiel, Germany)

Abstract:
Many speech processing algorithms rely on voice activity detection (VAD) that separates speech from noise. For this task, several features have been introduced that employ different characteristic properties of speech. In this contribution, we introduce a new feature that is robust against various types of noise. By considering an alternating excitation structure of low and high frequencies, speech is detected with a high confidence. The computationally low complex feature can cope even with the limited spectral resolution that is typical for in-car-communication systems. By combining the feature with a conventional modulation feature, the performance can be improved. Our simulations confirm the robustness of the feature and show the increasing performance compared to established VAD features.