Brain-Machine Coupled Learning Method for Facial Emotion Recognition

Neural network models of machine learning have shown promising prospects for visual tasks, such as facial emotion recognition (FER). However, the generalization of the model trained from a dataset with a few samples is limited. Unlike the machine, the human brain can effectively realize the required information from a few samples to complete the visual tasks. To learn the generalization ability of the brain, in this article, we propose a novel brain-machine coupled learning method for facial emotion recognition to let the neural network learn the visual knowledge of the machine and cognitive knowledge of the brain simultaneously. The proposed method utilizes visual images and electroencephalogram (EEG) signals to couple training the models in the visual and cognitive domains. Each domain model consists of two types of interactive channels, common and private. Since the EEG signals can reflect brain activity, the cognitive process of the brain is decoded by a model following reverse engineering. Decoding the EEG signals induced by the facial emotion images, the common channel in the visual domain can approach the cognitive process in the cognitive domain. Moreover, the knowledge specific to each domain is found in each private channel using an adversarial strategy. After learning, without the participation of the EEG signals, only the concatenation of both channels in the visual domain is used to classify facial emotion images based on the visual knowledge of the machine and the cognitive knowledge learned from the brain. Experiments demonstrate that the proposed method can produce excellent performance on several public datasets. Further experiments show that the proposed method trained from the EEG signals has good generalization ability on new datasets and can be applied to other network models, illustrating the potential for practical applications.

[1]  Rencheng Song,et al.  EEG-Based Emotion Recognition via Neural Architecture Search , 2023, IEEE Transactions on Affective Computing.

[2]  Mehakpreet Singh,et al.  Emotion Detection with Facial Feature Recognition Using CNN & OpenCV , 2022, 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE).

[3]  J. Sengupta,et al.  Ensemble Machine Learning-Based Affective Computing for Emotion Recognition Using Dual-Decomposed EEG Signals , 2022, IEEE Sensors Journal.

[4]  Jaiteg Singh,et al.  A Deep Learning Approach for Real Time Facial Emotion Recognition , 2021, 2021 10th International Conference on System Modeling & Advancement in Research Trends (SMART).

[5]  Kidiyo Kpalma,et al.  Learning Vision Transformer with Squeeze and Excitation for Facial Expression Recognition , 2021, ArXiv.

[6]  Sunghoon Im,et al.  DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Simone Palazzo,et al.  Visual Saliency Detection guided by Neural Signals , 2020, 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020).

[8]  Danilo Bzdok,et al.  Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets , 2020, Nature Communications.

[9]  Liang Lin,et al.  Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph Learning , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Yang Zou,et al.  Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification , 2020, ECCV.

[11]  Xiaofeng Liu,et al.  Image2Audio: Facilitating Semi-supervised Audio Emotion Recognition with Facial Expression Image , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[12]  Roger Zimmermann,et al.  MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis , 2020, ACM Multimedia.

[13]  Marius Leordeanu,et al.  Reading into the mind's eye: Boosting automatic visual recognition with EEG signals , 2020, Neurocomputing.

[14]  Padmavati Khandnor,et al.  A comparative analysis of signal processing and classification methods for different applications based on EEG signals , 2020 .

[15]  Kui Jia,et al.  Discriminative Adversarial Domain Adaptation , 2019, AAAI.

[16]  Yvonne Rogers,et al.  Being Human: Human-Computer Interaction in the Year 2020 , 2019 .

[17]  Muhammad Sajjad,et al.  Human Behavior Understanding in Big Multimedia Data Using CNN based Facial Expression Recognition , 2019, Mob. Networks Appl..

[18]  Soraia M. Alarcão,et al.  Emotions Recognition Using EEG Signals: A Survey , 2019, IEEE Transactions on Affective Computing.

[19]  Yu Qiao,et al.  Frame Attention Networks for Facial Expression Recognition in Videos , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[20]  Wanzeng Kong,et al.  Facial Emotion Recognition Based on Brain and Machine Collaborative Intelligence , 2019, 2019 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA).

[21]  S. Nishida,et al.  Brain-mediated Transfer Learning of Convolutional Neural Networks , 2019, AAAI.

[22]  Quoc V. Le,et al.  Searching for MobileNetV3 , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23]  Jianmin Jiang,et al.  A Context-Supported Deep Learning Framework for Multimodal Brain Imaging Classification , 2019, IEEE Transactions on Human-Machine Systems.

[24]  Yang Yang,et al.  Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network , 2019, Neurocomputing.

[25]  Shervin Minaee,et al.  Deep-Emotion: Facial Expression Recognition Using Attentional Convolutional Network , 2019, Sensors.

[26]  Mubarak Shah,et al.  Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  M. Sakaki,et al.  Effects of emotion on cognitive processing , 2018, The Proceedings of the Annual Convention of the Japanese Psychological Association.

[28]  Ling Shao,et al.  Deep Multi-task Learning to Recognise Subtle Facial Expressions of Mental States , 2018, ECCV.

[29]  Cheng Wang,et al.  LRMM: Learning to Recommend with Missing Modalities , 2018, EMNLP.

[30]  Andrew Zisserman,et al.  Emotion Recognition in Speech using Cross-Modal Transfer in the Wild , 2018, ACM Multimedia.

[31]  Lin Chen,et al.  Visual Recognition in RGB Images and Videos by Learning from RGB-D Data , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Bo Chen,et al.  MnasNet: Platform-Aware Neural Architecture Search for Mobile , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Xiangyu Zhang,et al.  ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design , 2018, ECCV.

[34]  Shan Li,et al.  Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[35]  ByoungChul Ko,et al.  A Brief Review of Facial Emotion Recognition Based on Visual Information , 2018, Sensors.

[36]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37]  Mubarak Shah,et al.  Brain2Image: Converting Brain Signals into Images , 2017, ACM Multimedia.

[38]  Zhiyuan Li,et al.  Island Loss for Learning Discriminative Features in Facial Expression Recognition , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[39]  Mubarak Shah,et al.  Generative Adversarial Networks Conditioned by Brain Signals , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[40]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[41]  Jian Shen,et al.  Wasserstein Distance Guided Representation Learning for Domain Adaptation , 2017, AAAI.

[42]  Jiayu Zhou,et al.  Missing Modalities Imputation via Cascaded Residual Autoencoder , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Yong Du,et al.  Facial Expression Recognition Based on Deep Evolutional Spatial-Temporal Networks , 2017, IEEE Transactions on Image Processing.

[44]  Walter J. Scheirer,et al.  Using human brain activity to guide machine learning , 2017, Scientific Reports.

[45]  Trevor Darrell,et al.  Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Edwin Lughofer,et al.  Central Moment Discrepancy (CMD) for Domain-Invariant Representation Learning , 2017, ICLR.

[47]  Martin Kampel,et al.  Facial Expression Recognition using Convolutional Neural Networks: State of the Art , 2016, ArXiv.

[48]  Brent Lance,et al.  EEGNet: a compact convolutional neural network for EEG-based brain–computer interfaces , 2016, Journal of neural engineering.

[49]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  S. Palazzo,et al.  Deep Learning Human Mind for Automated Visual Classification , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  George Trigeorgis,et al.  Domain Separation Networks , 2016, NIPS.

[52]  Nikos Komodakis,et al.  Wide Residual Networks , 2016, BMVC.

[53]  Gaurav Sharma,et al.  LOMo: Latent Ordinal Model for Facial Analysis in Videos , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[55]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Junmo Kim,et al.  Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[57]  Ross B. Girshick,et al.  Reducing Overfitting in Deep Networks by Decorrelating Representations , 2015, ICLR.

[58]  Aurobinda Routray,et al.  Automatic facial expression recognition using features of salient facial patches , 2015, IEEE Transactions on Affective Computing.

[59]  Jun Li,et al.  Robust Representation and Recognition of Facial Emotions Using Extreme Sparse Learning , 2015, IEEE Transactions on Image Processing.

[60]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61]  R. K. Kulkarni,et al.  Face detection and facial expression recognition system , 2014, 2014 International Conference on Electronics and Communication Systems (ICECS).

[62]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[63]  Yong Peng,et al.  EEG-based emotion classification using deep belief networks , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[64]  Martin Buss,et al.  Feature Extraction and Selection for Emotion Recognition from EEG , 2014, IEEE Transactions on Affective Computing.

[65]  Forrest N. Iandola,et al.  DenseNet: Implementing Efficient ConvNet Descriptor Pyramids , 2014, ArXiv.

[66]  Christoph H. Lampert,et al.  Learning to Rank Using Privileged Information , 2013, 2013 IEEE International Conference on Computer Vision.

[67]  João Magalhães,et al.  Competitive affective gaming: winning with a smile , 2013, ACM Multimedia.

[68]  Mohammad Soleymani,et al.  Multimedia implicit tagging using EEG signals , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[69]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[70]  Bernhard Schölkopf,et al.  A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[71]  I. Gorodnitsky,et al.  EEG mu component responses to viewing emotional faces , 2012, Behavioural Brain Research.

[72]  Mohammad Rahmati,et al.  Driver drowsiness detection using face expression recognition , 2011, 2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA).

[73]  Alexei A. Efros,et al.  Unbiased look at dataset bias , 2011, CVPR 2011.

[74]  Kwang-Eun Ko,et al.  Development of a Facial Emotion Recognition Method Based on Combining AAM with DBN , 2010, 2010 International Conference on Cyberworlds.

[75]  Harry W. Agius,et al.  ELVIS: Entertainment-led video summaries , 2010, ACM Trans. Multim. Comput. Commun. Appl..

[76]  Takeo Kanade,et al.  The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[77]  Leontios J. Hadjileontiadis,et al.  Emotion Recognition From EEG Using Higher Order Crossings , 2010, IEEE Transactions on Information Technology in Biomedicine.

[78]  Mohammad Soleymani,et al.  A Bayesian framework for video affective representation , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[79]  Vladimir Vapnik,et al.  A new learning paradigm: Learning using privileged information , 2009, Neural Networks.

[80]  Johan Wagemans,et al.  Perceived Shape Similarity among Unfamiliar Objects and the Organization of the Human Object Vision Pathway , 2008, The Journal of Neuroscience.

[81]  J.-M. Sun,et al.  Facial emotion recognition in modern distant education system using SVM , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[82]  Léon J. M. Rothkrantz,et al.  Emotion recognition using brain activity , 2008, CompSysTech.

[83]  Wanqing Li,et al.  A Real-Time Facial Expression Recognition System for Online Games , 2008, Int. J. Comput. Games Technol..

[84]  P. Downing,et al.  The neural basis of visual body perception , 2007, Nature Reviews Neuroscience.

[85]  Christoph Bartneck,et al.  HCI and the Face: Towards an Art of the Soluble , 2007, HCI.

[86]  Bogdan Raducanu,et al.  Efficient Facial Expression Recognition for Human Robot Interaction , 2007, IWANN.

[87]  John R. Hershey,et al.  Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[88]  Miguel A. L. Nicolelis,et al.  Brain–machine interfaces: past, present and future , 2006, Trends in Neurosciences.

[89]  G. Rizzolatti,et al.  The mirror-neuron system. , 2004, Annual review of neuroscience.

[90]  N. Kanwisher,et al.  Cortical Regions Involved in Perceiving Object Shape , 2000, The Journal of Neuroscience.

[91]  Michael J. Lyons,et al.  Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[92]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[93]  Ettore Lettich,et al.  Ten Percent Electrode System for Topographic Studies of Spontaneous and Evoked EEG Activities , 1985 .

[94]  Andrzej Cichocki,et al.  CTFN: Hierarchical Learning for Multimodal Sentiment Analysis Using Coupled-Translation Fusion Network , 2021, ACL.

[95]  Houtan Jebelli,et al.  EEG Signal-Processing Framework to Obtain High-Quality Brain Waves from an Off-the-Shelf Wearable EEG Device , 2018, J. Comput. Civ. Eng..

[96]  Kannappan Palaniappan,et al.  Deep learning-based facial expression recognition for monitoring neurological disorders , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[97]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[98]  Mohammad Soleymani,et al.  Analysis of EEG Signals and Facial Expressions for Continuous Emotion Detection , 2016, IEEE Transactions on Affective Computing.

[99]  Rauf Izmailov,et al.  Learning using privileged information: similarity control and knowledge transfer , 2015, J. Mach. Learn. Res..

[100]  Liu Guangyuan,et al.  Application of EEG Signal in Emotion Recognition , 2010 .

[101]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[102]  A. Krizhevsky ImageNet Classification with Deep Convolutional Neural Networks , 2022 .