Efficient Parameter Transcoding Scheme for Interactive Spatial Audio Communication

Konferenz: Sprachkommunikation 2010 - 9. ITG-Fachtagung
06.10.2010 - 08.10.2010 in Bochum, Deutschland

Tagungsband: Sprachkommunikation 2010

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Kallinger, Markus; Falch, Cornelia; Kuech, Fabian (Fraunhofer Institute for Integrated Circuits IIS, 91058 Erlangen, Germany)

Directional Audio Coding (DirAC) is a well-proven technique for recording spatial sound and efficiently coding it into one or very few audio channels accompanied by parametric side information. Therefore, it is suited for teleconferencing featuring spatial rendering of distributed sources. In teleconferences with more than two attending parties, additional rendering capabilities are desired to (a) spatially distribute each party for better intelligibility of single speakers especially in situations of multiple active sources, (b) adjust levels of individual sources to the requirements of given listening preferences, and to (c) align acoustic with visual cues. MPEG Spatial Audio Object Coding (SAOC) provides this required functionality. Originally, SAOC was designed for having single separated audio objects, i. e., their signals as inputs. In this contribution we propose DirAC in acoustic front-end processing for SAOC, where directional filtering in DirAC’s parameter domain is used to separate single sources from an acoustic mixture. The paper presents a close look at an efficient transcoding of the parameters of the two considered techniques.