Linear Combining of Audio Features for Signal Classification in Ad-hoc Microphone Arrays

Konferenz: Speech Communication - 11. ITG-Fachtagung Sprachkommunikation
24.09.2014 - 26.09.2014 in Erlangen, Deutschland

Tagungsband: Speech Communication

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Gergen, Sebastian; Martin, Rainer (Institute of Communication Acoustics, Ruhr-Universitaet Bochum, 44780 Bochum, Germany)

Inhalt:
Audio signals are often corrupted by signal contributions from competing sources and reverberation in the acoustic environment. In an audio signal classification task these effects introduce a mismatch between test and training data, which decreases the classification accuracy. When multiple sources are simultaneously active and captured by multiple ad-hoc distributed microphones in a room, it is of interest to determine the type of each source based on the captured signal mixtures. Obviously, the microphones closest to a particular source are most suitable for its classification. However, it is not clear how to combine signal features extracted from the microphone signals in an ad-hoc array in order to classify the source signals reliably. In this contribution different data combination strategies are introduced. The resulting classification performance is analyzed based on simulations and audio recordings. When information from microphones within the critical distance of a source is combined with information from the other microphones in the room, a high classification accuracy can be obtained.