Ensemble neural network approach detecting pain intensity from facial expressions

Abstract This paper reports on research to design an ensemble deep learning framework that integrates fine-tuned, three-stream hybrid deep neural network (i.e., Ensemble Deep Learning Model, EDLM), employing Convolutional Neural Network (CNN) to extract facial image features, detect and accurately classify the pain. To develop the approach, the VGGFace is fine-tuned and integrated with Principal Component Analysis and employed to extract features in images from the Multimodal Intensity Pain database at the early phase of the model fusion. Subsequently, a late fusion, three layers hybrid CNN and recurrent neural network algorithm is developed with their outputs merged to produce image-classified features to classify pain levels. The EDLM model is then benchmarked by means of a single-stream deep learning model including several competing models based on deep learning methods. The results obtained indicate that the proposed framework is able to outperform the competing methods, applied in a multi-level pain detection database to produce a feature classification accuracy that exceeds 89 %, with a receiver operating characteristic of 93 %. To evaluate the generalization of the proposed EDLM model, the UNBC-McMaster Shoulder Pain dataset is used as a test dataset for all of the modelling experiments, which reveals the efficacy of the proposed method for pain classification from facial images. The study concludes that the proposed EDLM model can accurately classify pain and generate multi-class pain levels for potential applications in the medical informatics area, and should therefore, be explored further in expert systems for detecting and classifying the pain intensity of patients, and automatically evaluating the patients’ pain level accurately.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Sridha Sridharan,et al.  Automatically Detecting Pain in Video Through Facial Action Units , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Dacheng Tao,et al.  Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Baojun Zhao,et al.  Face Recognition System Using SVM Classifier and Feature Extraction by PCA and LDA Combination , 2009, 2009 International Conference on Computational Intelligence and Software Engineering.

[5]  K. Prkachin,et al.  The structure, reliability and validity of pain expression: Evidence from patients with shoulder pain , 2008, PAIN.

[6]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[7]  Chongsheng Zhang,et al.  An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme , 2018, Knowl. Based Syst..

[8]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[9]  Rashid Ansari,et al.  Learning Pain from Action Unit Combinations: A Weakly Supervised Approach via Multiple Instance Learning , 2017, IEEE Transactions on Affective Computing.

[10]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[11]  Ji Ma,et al.  Dimension reduction of image deep feature using PCA , 2019, J. Vis. Commun. Image Represent..

[12]  Michel Valstar,et al.  Automatic Neonatal Pain Estimation: An Acute Pain in Neonates Database , 2019, 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII).

[13]  Tianzi Jiang,et al.  A combinational feature selection and ensemble neural network method for classification of gene expression data , 2004, BMC Bioinformatics.

[14]  Miriam Kunz,et al.  Sex differences in facial encoding of pain. , 2006, The journal of pain : official journal of the American Pain Society.

[15]  Caifeng Shan,et al.  Learning local binary patterns for gender classification on real-world face images , 2012, Pattern Recognit. Lett..

[16]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[17]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Rodrigo Minetto,et al.  Hydra: An Ensemble of Convolutional Neural Networks for Geospatial Land Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Tsuhan Chen,et al.  The painful face - Pain expression recognition using active appearance models , 2009, Image Vis. Comput..

[20]  Jeffrey Soar,et al.  Enhanced deep learning algorithm development to detect pain intensity from facial expression images , 2020, Expert Syst. Appl..

[21]  Chuang Zhang,et al.  Horizontal and Vertical Ensemble with Deep Representation for Classification , 2013, ArXiv.

[22]  Ke Chen,et al.  Identity-aware convolutional neural networks for facial expression recognition , 2010 .

[23]  Hamido Fujita,et al.  Multi-Imbalance: An open-source software for multi-class imbalance learning , 2019, Knowl. Based Syst..

[24]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[25]  Jürgen Schmidhuber,et al.  LSTM recurrent networks learn simple context-free and context-sensitive languages , 2001, IEEE Trans. Neural Networks.

[26]  Ognjen Rudovic,et al.  Personalized Automatic Estimation of Self-Reported Pain Intensity from Facial Expressions , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[27]  M F Sanner,et al.  Python: a programming language for software integration and development. , 1999, Journal of molecular graphics & modelling.

[28]  Nikhil Ketkar,et al.  Introduction to Keras , 2017 .

[29]  Xiangliang Zhang,et al.  An up-to-date comparison of state-of-the-art classification algorithms , 2017, Expert Syst. Appl..

[30]  Amanda J. C. Sharkey,et al.  On Combining Artificial Neural Nets , 1996, Connect. Sci..

[31]  Rosalind W. Picard,et al.  Automatic Recognition Methods Supporting Pain Assessment: A Survey , 2019, IEEE Transactions on Affective Computing.

[32]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[33]  Kamal Nasrollahi,et al.  Deep Pain: Exploiting Long Short-Term Memory Networks for Facial Expression Classification , 2017, IEEE Transactions on Cybernetics.

[34]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[35]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[36]  Sergio Escalera,et al.  Deep Multimodal Pain Recognition: A Database and Comparison of Spatio-Temporal Visual Modalities , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[37]  Jane You,et al.  Adaptive Deep Metric Learning for Identity-Aware Facial Expression Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[38]  Xiaogang Wang,et al.  DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[40]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Thomas B. Moeslund,et al.  Facial Expression Recognition for Traumatic Brain Injured Patients , 2018, VISIGRAPP.

[42]  Thomas B. Moeslund,et al.  Distributed Computing and Monitoring Technologies for Older Patients , 2016, SpringerBriefs in Computer Science.

[43]  Jeffrey F. Cohn,et al.  Painful data: The UNBC-McMaster shoulder pain expression archive database , 2011, Face and Gesture 2011.

[44]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[45]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Noah A. Smith,et al.  Transition-Based Dependency Parsing with Stack Long Short-Term Memory , 2015, ACL.

[47]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[48]  Thomas B. Moeslund,et al.  Spatio-temporal Pain Recognition in CNN-Based Super-Resolved Facial Images , 2016, VAAM/FFER@ICPR.

[49]  Jeffrey Soar,et al.  A Joint Deep Neural Network Model for Pain Recognition from Face , 2019, 2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS).

[50]  Grigorios Tsoumakas,et al.  Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[51]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[52]  Xiaogang Wang,et al.  Deeply learned face representations are sparse, selective, and robust , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Chongsheng Zhang,et al.  Feature selection and resampling in class imbalance learning: Which comes first? An empirical study in the biological domain , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[54]  Radhika C. Damale,et al.  Face Recognition Based Attendance System Using Machine Learning Algorithms , 2018, 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS).

[55]  Guoying Zhao,et al.  Recurrent Convolutional Neural Network Regression for Continuous Pain Intensity Estimation in Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[56]  Nico Schertler,et al.  Improving JPEG Compression with Regression Tree Fields , 2014 .

[57]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..