On Iterative Exchange of Soft State Information in Two-Channel Automatic Speech Recognition

Conference: Sprachkommunikation - Beiträge zur 10. ITG-Fachtagung
09/26/2012 - 09/28/2012 at Braunschweig, Deutschland

Proceedings: Sprachkommunikation

Pages: 4Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Authors:
Scheler, David; Walz, Simon; Fingscheidt, Tim (Institute for Communications Technology, Technische Universität Braunschweig, 38106 Braunschweig, Germany)

Abstract:
The robustness of automatic speech recognition systems can be improved by exploiting further information sources such as additional acoustic channels or modalities. Since the arising problem of information fusion exhibits striking parallels to problems in digital communications, where the turbo principle [1] was a groundbreaking innovation, Shivappa et al. showed that a similar iterative scheme can be applied to multimodal speech recognition [2]. We provide new interpretations and propose significant modifications of their approach: First, we show that no modification of the forward-backward recognition algorithm is required; second, we dispense with their proposed heuristic model; third, we deliver our own interpretation and formulation of the extrinsic information passed between the recognizers. Our proposed method is successfully applied to a synthetic unimodal two-channel speech recognition task.