Hybrid spatiotemporal models for sentiment classification via galvanic skin response

Abstract Sentiment plays an important role in cognition, creativity, attention, and decision making in people’s daily lives. Researchers have made great progress in sentiment recognition through images and speech. In this paper, a multimodal dataset is proposed for sentiment classification (MDSTC 1 )—a multimodal dataset collected with multimodal channels by customized physiological sensors and manually labelled for human sentiment analysis. MDSTC contains galvanic skin response (GSR), pulse, speech, and facial expression data of 100 volunteers labelled with timeline, emotional self-ratings, and personal information such as age, sex and Big Five personality scores, which were collected while watching video clips to stimulate emotions. Using the MDSTC dataset, the GSR channel with six types of emotion labels is used to perform human sentiment analysis in this paper. The GSR signal is converted into a spectrogram to adopt image-related methods and deep learning models. A convolutional neural network, long short-term memory, and self-attention mechanism are adopted and combined, and spatiotemporal hybrid models are proposed to perform human sentiment analysis. Comparisons are performed between the proposed model and state-of-the-art models in related works. The experimental results show that with the proposed spatiotemporal hybrid models, better results are obtained with respect to precision, recall, and F1-score. The experimental results also show that with the proposed spatiotemporal hybrid models working with GSR, people’s emotional changes can be obtained in real time with high precision.

[1]  Alex Graves,et al.  Long Short-Term Memory , 2020, Computer Vision.

[2]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[3]  I. Nyklíček,et al.  Cardiorespiratory differentiation of musically-induced emotions. , 1997 .

[4]  Lai Xiangwei Applications of simulated annealing-immune particle swarm optimization in emotion recognition of galvanic skin response signal , 2011 .

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Atefeh Goshvarpour,et al.  An accurate emotion recognition system using ECG and GSR signals and matching pursuit method , 2017, Biomedical journal.

[7]  Justin Salamon,et al.  Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification , 2016, IEEE Signal Processing Letters.

[8]  Jürgen Schmidhuber,et al.  LSTM can Solve Hard Long Time Lag Problems , 1996, NIPS.

[9]  Yusuf Yaslan,et al.  Emotion recognition via random forest and galvanic skin response: Comparison of time based feature sets, window sizes and wavelet approaches , 2016, 2016 Medical Technologies National Congress (TIPTEKNO).

[10]  Tieniu Tan,et al.  Affective Computing: A Review , 2005, ACII.

[11]  Mohammad Soleymani,et al.  A Multimodal Database for Affect Recognition and Implicit Tagging , 2012, IEEE Transactions on Affective Computing.

[12]  Elisabeth André,et al.  Emotion recognition based on physiological changes in music listening , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Yi Yang,et al.  Discovering Discriminative Graphlets for Aerial Image Categories Recognition , 2013, IEEE Transactions on Image Processing.

[14]  Xuelong Li,et al.  Image Categorization by Learning a Propagated Graphlet Path , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Alex Pentland,et al.  Coding, Analysis, Interpretation, and Recognition of Facial Expressions , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Min Wu,et al.  Speech emotion recognition based on an improved brain emotion learning model , 2018, Neurocomputing.

[18]  Kostas Karpouzis,et al.  The HUMAINE Database: Addressing the Collection and Annotation of Naturalistic and Induced Emotional Data , 2007, ACII.

[19]  Brent Lance,et al.  Optimal Arousal Identification and Classification for Affective Computing Using Physiological Signals: Virtual Reality Stroop Task , 2010, IEEE Transactions on Affective Computing.

[20]  Xiao Sun,et al.  Detecting influenza states based on hybrid model with personal emotional factors from social networks , 2016, Neurocomputing.

[21]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[22]  张平,et al.  Experiment study of autonomic nervous response patterns in five basic emotions , 2005 .

[23]  Dietmar F. Rösner,et al.  Inducing Genuine Emotions in Simulated Speech-Based Human-Machine Interaction: The NIMITEK Corpus , 2010, IEEE Transactions on Affective Computing.

[24]  Mingyang Liu,et al.  Human Emotion Recognition Based on Galvanic Skin Response Signal Feature Selection and SVM , 2016, 2016 International Conference on Smart City and Systems Engineering (ICSCSE).

[25]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[26]  Björn W. Schuller,et al.  Hidden Markov model-based speech emotion recognition , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[27]  Christine L. Lisetti,et al.  Using Noninvasive Wearable Computers to Recognize Human Emotions from Physiological Signals , 2004, EURASIP J. Adv. Signal Process..

[28]  Hermann Ney,et al.  From Feedforward to Recurrent LSTM Neural Networks for Language Modeling , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[29]  Eloy Irigoyen,et al.  An enhanced fuzzy algorithm based on advanced signal processing for identification of stress , 2018, Neurocomputing.

[30]  H. Becher,et al.  Absolute Risk and Loss-of-Lifetime Estimates for Quantitative Risk Assessment , 1998 .

[31]  A. Damasio,et al.  Basic emotions are associated with distinct patterns of cardiorespiratory activity. , 2006, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[32]  Jennifer Healey,et al.  Toward Machine Emotional Intelligence: Analysis of Affective Physiological State , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[34]  Xiao Sun,et al.  Emotional Human-Machine Conversation Generation Based on Long Short-Term Memory , 2017, Cognitive Computation.

[35]  Wootaek Lim,et al.  Speech emotion recognition using convolutional and Recurrent Neural Networks , 2016, 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[36]  Chao Li,et al.  Analysis of physiological for emotion recognition with the IRS model , 2016, Neurocomputing.

[37]  Xiao Wei Sun,et al.  A novel approach to generate a large scale of supervised data for short text sentiment analysis , 2018, Multimedia Tools and Applications.

[38]  Gunnar Rätsch,et al.  Input space versus feature space in kernel-based methods , 1999, IEEE Trans. Neural Networks.

[39]  Qiang Ji,et al.  Video Affective Content Analysis: A Survey of State-of-the-Art Methods , 2015, IEEE Transactions on Affective Computing.

[40]  A. Khasnobish,et al.  Emotion recognition employing ECG and GSR signals as markers of ANS , 2016, 2016 Conference on Advances in Signal Processing (CASP).

[41]  A. Angrilli,et al.  Cardiac responses associated with affective processing of unpleasant film stimuli. , 2000, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[42]  Xiao Sun,et al.  Sentiment analysis for Chinese microblog based on deep neural networks with convolutional extension features , 2016, Neurocomputing.

[43]  Leontios J. Hadjileontiadis,et al.  Emotion Recognition from Brain Signals Using Hybrid Adaptive Filtering and Higher Order Crossings Analysis , 2010, IEEE Transactions on Affective Computing.

[44]  Reza Derakhshani,et al.  An Ensemble Method for Classifying Startle Eyeblink Modulation from High-Speed Video Records , 2011, IEEE Transactions on Affective Computing.

[45]  Xuelong Li,et al.  Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook , 2016, IEEE Transactions on Cybernetics.

[46]  Carolyn Korsmeyer Rosalind W. Picard, Affective Computing , 2004, Minds and Machines.

[47]  Michael Biehl,et al.  Insightful stress detection from physiology modalities using Learning Vector Quantization , 2015, Neurocomputing.