On Iterative Exchange of Soft State Information in Two-Channel Automatic Speech Recognition
Conference: Sprachkommunikation - Beiträge zur 10. ITG-Fachtagung
09/26/2012 - 09/28/2012 at Braunschweig, Deutschland
Pages: 4Language: englishTyp: PDFPersonal VDE Members are entitled to a 10% discount on this title
Scheler, David; Walz, Simon; Fingscheidt, Tim (Institute for Communications Technology, Technische Universität Braunschweig, 38106 Braunschweig, Germany)
The robustness of automatic speech recognition systems can be improved by exploiting further information sources such as additional acoustic channels or modalities. Since the arising problem of information fusion exhibits striking parallels to problems in digital communications, where the turbo principle  was a groundbreaking innovation, Shivappa et al. showed that a similar iterative scheme can be applied to multimodal speech recognition . We provide new interpretations and propose significant modifications of their approach: First, we show that no modification of the forward-backward recognition algorithm is required; second, we dispense with their proposed heuristic model; third, we deliver our own interpretation and formulation of the extrinsic information passed between the recognizers. Our proposed method is successfully applied to a synthetic unimodal two-channel speech recognition task.