Fully Automatic Audiovisual Emotion Recognition: Voice,Words, and the Face

Konferenz: Sprachkommunikation - Beiträge zur 10. ITG-Fachtagung
26.09.2012 - 28.09.2012 in Braunschweig, Deutschland

Tagungsband: Sprachkommunikation

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Wöllmer, Martin; Kaiser, Moritz; Eyben, Florian; Weninger, Felix; Schuller, Björn; Rigoll, Gerhard (Institute for Human-Machine Communication, Technische Universität München, 80333 München, Germany)

The recognition of human emotions from spontaneous and non-prototypical real-life data is currently one of the most challenging tasks in the field of affective computing. This contribution presents our recent advances in assessing dimensional representations of emotion, such as arousal, expectation, power, and valence, in an audiovisual humancomputer interaction scenario. We propose a fully automatic multimodal recognition approach based on contextsensitive modeling of audio and video features. Evaluations on the Audiovisual Sub-Challenge of the 2011 Audio/ Visual Emotion Challenge show how accurately different affective dimensions can be recognized. Our experiments reveal that the proposed multimodal recognition system outperforms previously introduced techniques evaluated on the same task.