Introducing Block-Wise Processing into Turbo Viterbi ASR

Konferenz: Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
05.10.2016 - 07.10.2016 in Paderborn, Deutschland

Tagungsband: ITG-Fb. 267: Speech Communication

Seiten: 5Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Receveur, Simon; Lohrenz, Timo; Fingscheidt, Tim (Institute for Communications Technology, Technische Universität Braunschweig, 38106 Braunschweig, Germany)

Recently, turbo automatic speech recognition (ASR) has been proposed as intermediate-level fusion approach for multi-channel, or multi-model, or multi-modal ASR. To apply turbo ASR in a continuous recognition application, the turbo Viterbi ASR has to be used. Working towards turbo ASR for real-time implementations in a practical context, the contributions of this paper are the following: After briefly reviewing the turbo Viterbi ASR approach, we extend it by introducing a new block-wise processing of the iterations, and a new simplified weighting scheme for both the likelihoods and the so-called extrinsic information. Applied to a single channel unimodal ASR task with fusion of two types of features, we analyze and discuss the proposed extensions with respect to latency and required complexity.