Deep Learning for Multimodal Emotion Recognition-Attentive Residual Disconnected RNN

Erick Chandra, Jane Yung Jen Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Human communicates using verbal and non-verbal cues. One of the most essential elements that complements the understanding of communication is emotion. Emotion is expressed not only in words, but also facial expressions, body language, tone, etc. Therefore, we formulate the emotion recognition as a multimodal task.Emotions are usually described in a sequence along with the utterances. In recent years, RNN-based models have been known to be good at modeling the entire sequence and capturing long-term dependencies. However, it lacks the ability to extract local key patterns and position-invariant features. Hence, we adopt Deep Attentive Residual Disconnected RNN model which incorporates the concept from both RNN and CNN to enhance the ability to capture spatial and temporal features.We utilize CMU MOSEI dataset comprising of language, visual, and acoustic modalities for training and evaluating our model. The results show that Deep Attentive Residual Disconnected RNN model outperforms the baseline. Besides, the use of multimodal approach also solidifies the recognition better compared to those of single modalities.

Original languageEnglish
Title of host publicationProceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728146669
DOIs
StatePublished - 11 2019
Externally publishedYes
Event24th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019 - Kaohsiung, Taiwan
Duration: 21 11 201923 11 2019

Publication series

NameProceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019

Conference

Conference24th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019
Country/TerritoryTaiwan
CityKaohsiung
Period21/11/1923/11/19

Bibliographical note

Publisher Copyright:
© 2019 IEEE.

Keywords

  • Attention Mechanism
  • Disconnected Recurrent Neural Network
  • Emotion Recognition
  • Residual Network

Fingerprint

Dive into the research topics of 'Deep Learning for Multimodal Emotion Recognition-Attentive Residual Disconnected RNN'. Together they form a unique fingerprint.

Cite this