Efficient Parameter Transcoding Scheme for Interactive Spatial Audio Communication

Conference: Sprachkommunikation 2010 - 9. ITG-Fachtagung
10/06/2010 - 10/08/2010 at Bochum, Deutschland

Proceedings: Sprachkommunikation 2010

Pages: 4Language: englishTyp: PDF

Personal VDE Members are entitled to a 10% discount on this title

Kallinger, Markus; Falch, Cornelia; Kuech, Fabian (Fraunhofer Institute for Integrated Circuits IIS, 91058 Erlangen, Germany)

Directional Audio Coding (DirAC) is a well-proven technique for recording spatial sound and efficiently coding it into one or very few audio channels accompanied by parametric side information. Therefore, it is suited for teleconferencing featuring spatial rendering of distributed sources. In teleconferences with more than two attending parties, additional rendering capabilities are desired to (a) spatially distribute each party for better intelligibility of single speakers especially in situations of multiple active sources, (b) adjust levels of individual sources to the requirements of given listening preferences, and to (c) align acoustic with visual cues. MPEG Spatial Audio Object Coding (SAOC) provides this required functionality. Originally, SAOC was designed for having single separated audio objects, i. e., their signals as inputs. In this contribution we propose DirAC in acoustic front-end processing for SAOC, where directional filtering in DirAC’s parameter domain is used to separate single sources from an acoustic mixture. The paper presents a close look at an efficient transcoding of the parameters of the two considered techniques.