An Open Source Corpus and Recording Software for Distant Speech Recognition with the Microsoft Kinect

Conference: Speech Communication - 11. ITG-Fachtagung Sprachkommunikation
09/24/2014 - 09/26/2014 at Erlangen, Deutschland

Proceedings: Speech Communication

Pages: 4Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Schnelle-Walka, Dirk; Radeck-Arneth, Stephan; Biemann, Chris; Radomski, Stefan (Telecooperation, Language Technology, Technische Universitaet Darmstadt, 64289 Darmstadt, Germany)

Abstract:
A basic requirement for improvements in distant speech recognition is the availability of a respective corpus of recorded utterances. With microphone arrays now available off-the-shelf as part of the Microsoft Kinect, a common recording device for such a corpus is wide-spread. In this paper, we introduce KiSRecord, an open source recording tool that can be used to alleviate this situation for data collection. Thus, we provide the first steps towards a community sourcing effort for a speech corpus recorded with de-facto standard microphone arrays.