Densely Connected Convolutional Networks for Speech Recognition

Conference: Speech Communication - 13. ITG-Fachtagung Sprachkommunikation
10/10/2018 - 10/12/2018 at Oldenburg, Deutschland

Proceedings: Speech Communication

Pages: 5Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Li, Chia Yu; Vu, Ngoc Thang (Institute for Natural Language Processing (IMS), University of Stuttgart, Germany)

Abstract:
This paper presents our latest investigation on Densely Connected Convolutional Networks (DenseNets) for acoustic modelling (AM) in automatic speech recognition. DenseNets are very deep, compact convolutional neural networks, which have demonstrated incredible improvements over the state-of-the-art results on several data sets in computer vision. Our experimental results show that DenseNet can be used for AM significantly outperforming other neuralbased models such as DNNs, CNNs, VGGs. Furthermore, results onWall Street Journal revealed that with only a half of the training data DenseNet was able to outperform other models trained with the full data set by a large margin.