An Open Source Corpus and Recording Software for Distant Speech Recognition with the Microsoft Kinect

Konferenz: Speech Communication - 11. ITG-Fachtagung Sprachkommunikation
24.09.2014 - 26.09.2014 in Erlangen, Deutschland

Tagungsband: Speech Communication

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Schnelle-Walka, Dirk; Radeck-Arneth, Stephan; Biemann, Chris; Radomski, Stefan (Telecooperation, Language Technology, Technische Universitaet Darmstadt, 64289 Darmstadt, Germany)

Inhalt:
A basic requirement for improvements in distant speech recognition is the availability of a respective corpus of recorded utterances. With microphone arrays now available off-the-shelf as part of the Microsoft Kinect, a common recording device for such a corpus is wide-spread. In this paper, we introduce KiSRecord, an open source recording tool that can be used to alleviate this situation for data collection. Thus, we provide the first steps towards a community sourcing effort for a speech corpus recorded with de-facto standard microphone arrays.