Two-Stream Attention Network for Pain Recognition from Video Sequences

Several approaches have been proposed for the analysis of pain-related facial expressions. These approaches range from common classification architectures based on a set of carefully designed handcrafted features, to deep neural networks characterised by an autonomous extraction of relevant facial descriptors and simultaneous optimisation of a classification architecture. In the current work, an end-to-end approach based on attention networks for the analysis and recognition of pain-related facial expressions is proposed. The method combines both spatial and temporal aspects of facial expressions through a weighted aggregation of attention-based neural networks’ outputs, based on sequences of Motion History Images (MHIs) and Optical Flow Images (OFIs). Each input stream is fed into a specific attention network consisting of a Convolutional Neural Network (CNN) coupled to a Bidirectional Long Short-Term Memory (BiLSTM) Recurrent Neural Network (RNN). An attention mechanism generates a single weighted representation of each input stream (MHI sequence and OFI sequence), which is subsequently used to perform specific classification tasks. Simultaneously, a weighted aggregation of the classification scores specific to each input stream is performed to generate a final classification output. The assessment conducted on both the BioVid Heat Pain Database (Part A) and SenseEmotion Database points at the relevance of the proposed approach, as its classification performance is on par with state-of-the-art classification approaches proposed in the literature.

[1]  Patrick Thiam,et al.  Methods for Person-Centered Continuous Pain Intensity Assessment From Bio-Physiological Channels , 2016, IEEE Journal of Selected Topics in Signal Processing.

[2]  Patrick Thiam,et al.  Audio-Visual Recognition of Pain Intensity , 2016, MPRSS.

[3]  Yann LeCun,et al.  Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[4]  Markus Kächele,et al.  Data fusion for automated pain recognition , 2015, 2015 9th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth).

[5]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[6]  Robert T. Collins,et al.  Moving Object Localization in Thermal Imagery by Forward-backward MHI , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[7]  Patrick Thiam,et al.  Using a Quartile-based Data Transformation for Pain Intensity Classification based on the SenseEmotion Database , 2019, 2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW).

[8]  Ayoub Al-Hamadi,et al.  The biovid heat pain database data for the advancement and systematic validation of an automated pain recognition system , 2013, 2013 IEEE International Conference on Cybernetics (CYBCO).

[9]  Gregory D. Hager,et al.  Regularizing face verification nets for pain intensity regression , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[10]  Patrick Thiam,et al.  Exploring Deep Physiological Models for Nociceptive Pain Recognition , 2019, bioRxiv.

[11]  Ferda Nur Alpaslan,et al.  Optical flow-based representation for video action detection , 2015, CVPR 2015.

[12]  Yu Qiao,et al.  Frame Attention Networks for Facial Expression Recognition in Videos , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[13]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[14]  Nadeem Ahmad Khan,et al.  Pain Intensity Evaluation through Facial Action Units , 2014, 2014 22nd International Conference on Pattern Recognition.

[15]  Sridha Sridharan,et al.  Automatically detecting pain using facial actions , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[16]  Jianda Han,et al.  Physiological Signal-Based Method for Measurement of Pain Intensity , 2017, Front. Neurosci..

[17]  Maja Pantic,et al.  The automatic detection of chronic pain-related expression : requirements , challenges and a multimodal dataset , 2014 .

[18]  K. Prkachin,et al.  The structure, reliability and validity of pain expression: Evidence from patients with shoulder pain , 2008, PAIN.

[19]  Esa Rahtu,et al.  BSIF: Binarized statistical image features , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[20]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[21]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Shiliang Sun,et al.  A survey of multi-view machine learning , 2013, Neural Computing and Applications.

[23]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[24]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[25]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[26]  Patrick Thiam,et al.  Combining Deep and Hand-Crafted Features for Audio-Based Pain Intensity Classification , 2018, MPRSS.

[27]  Patrick Thiam,et al.  Hierarchical Combination of Video Features for Personalised Pain Level Recognition , 2017, ESANN.

[28]  Yoshua Bengio,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[29]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[30]  Shigeo Abe Support Vector Machines for Pattern Classification , 2010, Advances in Pattern Recognition.

[31]  Patrick Thiam,et al.  The SenseEmotion Database: A Multimodal Database for the Development and Systematic Validation of an Automatic Pain- and Emotion-Recognition System , 2016, MPRSS.

[32]  Kamal Nasrollahi,et al.  Deep Pain: Exploiting Long Short-Term Memory Networks for Facial Expression Classification , 2017, IEEE Transactions on Cybernetics.

[33]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  A. Akobeng,et al.  THE STRUCTURE, RELIABILITY AND VALIDITY OF THE IMPACT QUESTIONNAIRE TO MEASURE QUALITY OF LIFE IN CHILDREN WITH INFLAMMATORY BOWEL DISEASE: PG2-19 , 2005 .

[35]  Niko Brümmer,et al.  Application-independent evaluation of speaker detection , 2006, Comput. Speech Lang..

[36]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[37]  Gustavo Moreira da Silva,et al.  Automatic pain quantification using autonomic parameters , 2014 .

[38]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[39]  Thomas B. Moeslund,et al.  Spatio-temporal Pain Recognition in CNN-Based Super-Resolved Facial Images , 2016, VAAM/FFER@ICPR.

[40]  Steven S. Beauchemin,et al.  The computation of optical flow , 1995, CSUR.

[41]  Jeffrey Soar,et al.  A Joint Deep Neural Network Model for Pain Recognition from Face , 2019, 2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS).

[42]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[43]  Patrick Thiam,et al.  Multi-Modal Pain Intensity Recognition Based on the SenseEmotion Database , 2019, IEEE Transactions on Affective Computing.

[44]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[45]  Maja Pantic,et al.  The Automatic Detection of Chronic Pain-Related Expression: Requirements, Challenges and the Multimodal EmoPain Dataset , 2016, IEEE Transactions on Affective Computing.

[46]  Patrick Thiam,et al.  Multimodal Data Fusion for Person-Independent, Continuous Estimation of Pain Intensity , 2015, EANN.

[47]  Jeffrey Soar,et al.  Deep Learning Model for Detection of Pain Intensity from Facial Expression , 2018, International Conference on Smart Homes and Health Telematics.

[48]  Jeffrey F. Cohn,et al.  Painful data: The UNBC-McMaster shoulder pain expression archive database , 2011, Face and Gesture 2011.

[49]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[50]  Abdenour Hadid,et al.  On pain assessment from facial videos using spatio-temporal local descriptors , 2016, 2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA).

[51]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[52]  Patrick Thiam,et al.  Multi-modal data fusion for pain intensity assessment and classification , 2017, 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA).

[53]  S. Rezaei,et al.  Pain Recognition Using Artificial Neural Network , 2006, 2006 IEEE International Symposium on Signal Processing and Information Technology.

[54]  Matthew S. Goodwin,et al.  Automated pain detection in facial videos of children using human-assisted transfer learning , 2018, AIH@IJCAI.

[55]  Peter Robinson,et al.  OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[56]  Ayoub Al-Hamadi,et al.  Automatic Pain Assessment with Facial Activity Descriptors , 2017, IEEE Transactions on Affective Computing.

[57]  Daniel McDuff,et al.  Facial Action Unit Detection Using Active Learning and an Efficient Non-linear Kernel Approximation , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[58]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[59]  Ville Ojansivu,et al.  Blur Insensitive Texture Classification Using Local Phase Quantization , 2008, ICISP.

[60]  Patrick Thiam,et al.  Adaptive confidence learning for the personalization of pain intensity estimation systems , 2017, Evol. Syst..

[61]  Xiaolin Hu,et al.  Recurrent convolutional neural network for object recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62]  Rosalind W. Picard,et al.  Continuous Pain Intensity Estimation from Autonomic Signals with Recurrent Neural Networks , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[63]  Michel F. Valstar,et al.  Local Gabor Binary Patterns from Three Orthogonal Planes for Automatic Facial Expression Recognition , 2013, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction.

[64]  Md. Atiqur Rahman Ahad,et al.  Motion history image: its variants and applications , 2012, Machine Vision and Applications.

[65]  Guoying Zhao,et al.  Recurrent Convolutional Neural Network Regression for Continuous Pain Intensity Estimation in Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[66]  Ayoub Al-Hamadi,et al.  Analysis of facial expressiveness during experimentally induced heat pain , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW).

[67]  Patrick Thiam,et al.  Deep Domain Adaptation for Facial Expression Analysis , 2019, 2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW).

[68]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[69]  Ayoub Al-Hamadi,et al.  Automatic Pain Recognition from Video and Biomedical Signals , 2014, 2014 22nd International Conference on Pattern Recognition.

[70]  Chi-Chun Lee,et al.  Toward Development and Evaluation of Pain Level-Rating Scale for Emergency Triage based on Vocal Characteristics and Facial Expressions , 2016, INTERSPEECH.