Multimodal Emotion Recognition Using Deep Canonical Correlation Analysis

Multimodal signals are more powerful than unimodal data for emotion recognition since they can represent emotions more comprehensively. In this paper, we introduce deep canonical correlation analysis (DCCA) to multimodal emotion recognition. The basic idea behind DCCA is to transform each modality separately and coordinate different modalities into a hyperspace by using specified canonical correlation analysis constraints. We evaluate the performance of DCCA on five multimodal datasets: the SEED, SEED-IV, SEED-V, DEAP, and DREAMER datasets. Our experimental results demonstrate that DCCA achieves state-of-the-art recognition accuracy rates on all five datasets: 94.58% on the SEED dataset, 87.45% on the SEED-IV dataset, 84.33% and 85.62% for two binary classification tasks and 88.51% for a four-category classification task on the DEAP dataset, 83.08% on the SEED-V dataset, and 88.99%, 90.57%, and 90.67% for three binary classification tasks on the DREAMER dataset. We also compare the noise robustness of DCCA with that of existing methods when adding various amounts of noise to the SEED-V dataset. The experimental results indicate that DCCA has greater robustness. By visualizing feature distributions with t-SNE and calculating the mutual information between different modalities before and after using DCCA, we find that the features transformed by DCCA from different modalities are more homogeneous and discriminative across emotions.

[1]  Michel Grabisch,et al.  Application of the Choquet integral in multicriteria decision making , 2000 .

[2]  Clayton D. Scott,et al.  Robust kernel density estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Wei Liu,et al.  Multimodal Emotion Recognition Using Deep Neural Networks , 2017, ICONIP.

[4]  Xiang Li,et al.  Emotion recognition from multi-channel EEG data through Convolutional Recurrent Neural Network , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[5]  Mohammad Soleymani,et al.  Analysis of EEG Signals and Facial Expressions for Continuous Emotion Detection , 2016, IEEE Transactions on Affective Computing.

[6]  Bao-Liang Lu,et al.  Identifying Stable Patterns over Time for Emotion Recognition from EEG , 2016, IEEE Transactions on Affective Computing.

[7]  Jennifer Healey,et al.  Toward Machine Emotional Intelligence: Analysis of Affective Physiological State , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Yuan-Pin Lin,et al.  Generalizations of the subject-independent feature set for music-induced emotion recognition , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[9]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[10]  Mark E Josephson,et al.  Frequency content and characteristics of ventricular conduction. , 2015, Journal of electrocardiology.

[11]  Fadel Adib,et al.  Emotion recognition using wireless signals , 2016, MobiCom.

[12]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[13]  Ying Chen,et al.  Combining feature-level and decision-level fusion in a hierarchical classifier for emotion recognition in the wild , 2015, Journal on Multimodal User Interfaces.

[14]  Bao-Liang Lu,et al.  Differential entropy feature for EEG-based vigilance estimation , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[15]  Wei Zhang,et al.  Cross-Subject EEG Feature Selection for Emotion Recognition Using Transfer Recursive Feature Elimination , 2017, Front. Neurorobot..

[16]  Rifai Chai,et al.  A Hybrid Fuzzy Cognitive Map/Support Vector Machine Approach for EEG-Based Emotion Classification Using Compressed Sensing , 2018, Int. J. Fuzzy Syst..

[17]  Bao-Liang Lu,et al.  Differential entropy feature for EEG-based emotion classification , 2013, 2013 6th International IEEE/EMBS Conference on Neural Engineering (NER).

[18]  Wei Liu,et al.  Emotion Recognition Using Multimodal Deep Learning , 2016, ICONIP.

[19]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[20]  Rafael A. Calvo,et al.  Classification of affects using head movement, skin color features and physiological signals , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[21]  Roger Zimmermann,et al.  Self-Attentive Feature-Level Fusion for Multimodal Emotion Detection , 2018, 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[22]  ByoungChul Ko,et al.  A Brief Review of Facial Emotion Recognition Based on Visual Information , 2018, Sensors.

[23]  Jiebo Luo,et al.  Unsupervised Alignment of Natural Language Instructions with Video Segments , 2014, AAAI.

[24]  Thierry Pun,et al.  DEAP: A Database for Emotion Analysis ;Using Physiological Signals , 2012, IEEE Transactions on Affective Computing.

[25]  Thierry Pun,et al.  Multimodal Emotion Recognition in Response to Videos , 2012, IEEE Transactions on Affective Computing.

[26]  Erik Cambria,et al.  A review of affective computing: From unimodal analysis to multimodal fusion , 2017, Inf. Fusion.

[27]  Q. M. Jonathan Wu,et al.  EEG-Based Emotion Recognition Using Hierarchical Network With Subnetwork Nodes , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[28]  Zhong Yin,et al.  Recognition of emotions using multimodal physiological signals and an ensemble deep learning model , 2017, Comput. Methods Programs Biomed..

[29]  Jeff A. Bilmes,et al.  Deep Canonical Correlation Analysis , 2013, ICML.

[30]  Sidney K. D'Mello,et al.  A Review and Meta-Analysis of Multimodal Affect Detection Systems , 2015, ACM Comput. Surv..

[31]  A. C. Young,et al.  Frequency Analysis of the Electrocardiogram , 1960, Circulation research.

[32]  Colin Fyfe,et al.  Kernel and Nonlinear Canonical Correlation Analysis , 2000, IJCNN.

[33]  Mohammad H. Mahoor,et al.  AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild , 2017, IEEE Transactions on Affective Computing.

[34]  Tae-Kyun Kim,et al.  Tensor Canonical Correlation Analysis for Action Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Yue Wang,et al.  A three-stage decision framework for multi-subject emotion recognition using physiological signals , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[36]  Michio Sugeno,et al.  A study on subjective evaluations of printed color images , 1991, Int. J. Approx. Reason..

[37]  Samuel Kaski,et al.  Bayesian Canonical correlation analysis , 2013, J. Mach. Learn. Res..

[38]  Bao-Liang Lu,et al.  Off-line and on-line vigilance estimation based on linear dynamical system and manifold learning , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[39]  John Shawe-Taylor,et al.  Sparse canonical correlation analysis , 2009, Machine Learning.

[40]  Bao-Liang Lu,et al.  Investigating Critical Frequency Bands and Channels for EEG-Based Emotion Recognition with Deep Neural Networks , 2015, IEEE Transactions on Autonomous Mental Development.

[41]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[42]  Bao-Liang Lu,et al.  Emotional state classification from EEG data using machine learning approach , 2014, Neurocomputing.

[43]  Bao-Liang Lu,et al.  Classification of Five Emotions from EEG and Eye Movement Signals: Discrimination Ability and Stability over Time , 2019, 2019 9th International IEEE/EMBS Conference on Neural Engineering (NER).

[44]  Bing Li,et al.  Gender classification by combining clothing, hair and facial component classifiers , 2012, Neurocomputing.

[45]  Samuel Kaski,et al.  Probabilistic approach to detecting dependencies between data sets , 2008, Neurocomputing.

[46]  Nikhil Rasiwasia,et al.  Cluster Canonical Correlation Analysis , 2014, AISTATS.

[47]  Fakhri Karray,et al.  Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..

[48]  Mohd Yusoff Mashor,et al.  ECG signals classification based on discrete wavelet transform, time domain and frequency domain features , 2015, 2015 2nd International Conference on Biomedical Engineering (ICoBE).

[49]  Yifei Lu,et al.  Combining Eye Movements and EEG to Enhance Emotion Recognition , 2015, IJCAI.

[50]  Wenming Zheng,et al.  EEG Emotion Recognition Using Dynamical Graph Convolutional Neural Networks , 2020, IEEE Transactions on Affective Computing.

[51]  A. Jacobs,et al.  The coupling of emotion and cognition in the eye: introducing the pupil old/new effect. , 2007, Psychophysiology.

[52]  Andrzej Cichocki,et al.  EmotionMeter: A Multimodal Framework for Recognizing Human Emotions , 2019, IEEE Transactions on Cybernetics.

[53]  Christian Jutten,et al.  Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects , 2015, Proceedings of the IEEE.

[54]  Aaron C. Courville,et al.  MINE: Mutual Information Neural Estimation , 2018, ArXiv.

[55]  Rosalind W. Picard Affective Computing , 1997 .

[56]  Marc'Aurelio Ranzato,et al.  DeViSE: A Deep Visual-Semantic Embedding Model , 2013, NIPS.

[57]  Rui Li,et al.  Classification of Five Emotions from EEG and Eye Movement Signals: Complementary Representation Properties , 2019, 2019 9th International IEEE/EMBS Conference on Neural Engineering (NER).

[58]  Wei Liu,et al.  Multi-view Emotion Recognition Using Deep Canonical Correlation Analysis , 2018, ICONIP.

[59]  Louis-Philippe Morency,et al.  Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Elisabeth André,et al.  Emotion recognition based on physiological changes in music listening , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[61]  Yu-Liang Hsu,et al.  Automatic ECG-Based Emotion Recognition in Music Listening , 2020, IEEE Transactions on Affective Computing.

[62]  Osmar R. Zaïane,et al.  Current State of Text Sentiment Analysis from Opinion to Emotion Mining , 2017, ACM Comput. Surv..

[63]  Naeem Ramzan,et al.  DREAMER: A Database for Emotion Recognition Through EEG and ECG Signals From Wireless Low-cost Off-the-Shelf Devices , 2018, IEEE Journal of Biomedical and Health Informatics.