Abstract
Human communicates using verbal and non-verbal cues. One of the most essential elements that complements the understanding of communication is emotion. Emotion is expressed not only in words, but also facial expressions, body language, tone, etc. Therefore, we formulate the emotion recognition as a multimodal task.Emotions are usually described in a sequence along with the utterances. In recent years, RNN-based models have been known to be good at modeling the entire sequence and capturing long-term dependencies. However, it lacks the ability to extract local key patterns and position-invariant features. Hence, we adopt Deep Attentive Residual Disconnected RNN model which incorporates the concept from both RNN and CNN to enhance the ability to capture spatial and temporal features.We utilize CMU MOSEI dataset comprising of language, visual, and acoustic modalities for training and evaluating our model. The results show that Deep Attentive Residual Disconnected RNN model outperforms the baseline. Besides, the use of multimodal approach also solidifies the recognition better compared to those of single modalities.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781728146669 |
| DOIs | |
| State | Published - 11 2019 |
| Externally published | Yes |
| Event | 24th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019 - Kaohsiung, Taiwan Duration: 21 11 2019 → 23 11 2019 |
Publication series
| Name | Proceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019 |
|---|
Conference
| Conference | 24th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019 |
|---|---|
| Country/Territory | Taiwan |
| City | Kaohsiung |
| Period | 21/11/19 → 23/11/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- Attention Mechanism
- Disconnected Recurrent Neural Network
- Emotion Recognition
- Residual Network