Spatio-temporal Pain Recognition in CNN-Based Super-Resolved Facial Images

Automatic pain detection is a long expected solution to a prevalent medical problem of pain management. This is more relevant when the subject of pain is young children or patients with limited ability to communicate about their pain experience. Computer vision-based analysis of facial pain expression provides a way of efficient pain detection. When deep machine learning methods came into the scene, automatic pain detection exhibited even better performance. In this paper, we figured out three important factors to exploit in automatic pain detection: spatial information available regarding to pain in each of the facial video frames, temporal axis information regarding to pain expression pattern in a subject video sequence, and variation of face resolution. We employed a combination of convolutional neural network and recurrent neural network to setup a deep hybrid pain detection framework that is able to exploit both spatial and temporal pain information from facial video. In order to analyze the effect of different facial resolutions, we introduce a super-resolution algorithm to generate facial video frames with different resolution setups. We investigated the performance on the publicly available UNBC-McMaster Shoulder Pain database. As a contribution, the paper provides novel and important information regarding to the performance of a hybrid deep learning framework for pain detection in facial images of different resolution.

[1]  Thomas S. Huang,et al.  Image Super-Resolution: Historical Overview and Future Challenges , 2017 .

[2]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[3]  Sethuraman Panchanathan,et al.  Multimodal emotion recognition using deep learning architectures , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[4]  Matthew S. Goodwin,et al.  Automated Assessment of Children’s Postoperative Pain Using Computer Vision , 2015, Pediatrics.

[5]  R. Redi [FACIAL EXPRESSION IN PAIN]. , 1965, Rassegna clinico-scientifica.

[6]  Christopher Joseph Pal,et al.  EmoNets: Multimodal deep learning approaches for emotion recognition in video , 2015, Journal on Multimodal User Interfaces.

[7]  Laura J. Hoeksema,et al.  Caring for Patients With Chronic Pain: Pearls and Pitfalls , 2013, The Journal of the American Osteopathic Association.

[8]  Andrew Zisserman,et al.  Super-resolution enhancement of text image sequences , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  K. Prkachin,et al.  The structure, reliability and validity of pain expression: Evidence from patients with shoulder pain , 2008, PAIN.

[10]  Thomas B. Moeslund,et al.  Pain recognition using spatiotemporal oriented energy of facial muscles , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[11]  Kwang In Kim,et al.  Example-based learning for image super-resolution , 2004 .

[12]  Cha Zhang,et al.  Image based Static Facial Expression Recognition with Multiple Deep Network Learning , 2015, ICMI.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  K. Craig,et al.  Measuring movement-exacerbated pain in cognitively impaired frail elders. , 2000, The Clinical journal of pain.

[15]  V.R.S Mani,et al.  Survey of Medical Image Registration , 2013 .

[16]  Feng Li,et al.  Universal HMT based super resolution for remote sensing images , 2008, 2008 15th IEEE International Conference on Image Processing.

[17]  Arindam Sikdar,et al.  Computer-Vision-Guided Human Pulse Rate Estimation: A Review , 2016, IEEE Reviews in Biomedical Engineering.

[18]  Thomas B. Moeslund,et al.  Aalborg Universitet Pain Expression as a Biometric Why Patients’ Self-Reported Pain doesn’t Match with the Objectively Measured Pain? , 2016 .

[19]  Thomas B. Moeslund,et al.  Quality-Aware Estimation of Facial Landmarks in Video Sequences , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[20]  Thomas S. Huang,et al.  How deep neural networks can improve emotion recognition on video data , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[21]  K. Craig,et al.  Genuine, suppressed and faked facial behavior during exacerbation of chronic low back pain , 1991, Pain.

[22]  Guoying Zhao,et al.  Recurrent Convolutional Neural Network Regression for Continuous Pain Intensity Estimation in Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[23]  K. Scherer,et al.  FACIAL EXPRESSION , 2017 .

[24]  K. Prkachin The consistency of facial expressions of pain: a comparison across modalities , 1992, Pain.

[25]  Miriam Kunz,et al.  Sex differences in facial encoding of pain. , 2006, The journal of pain : official journal of the American Pain Society.

[26]  Yang Wang,et al.  rnn : Recurrent Library for Torch , 2015, ArXiv.

[27]  V. Chandran,et al.  Investigation into Optical Flow Super-Resolution for Surveillance Applications , 2005 .

[28]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[29]  Soo-Young Lee,et al.  Hierarchical committee of deep convolutional neural networks for robust facial expression recognition , 2016, Journal on Multimodal User Interfaces.

[30]  Shiguang Shan,et al.  Face recognition on large-scale video in the wild with hybrid Euclidean-and-Riemannian metric learning , 2015, Pattern Recognit..

[31]  Bruce H. Pillman Super-Resolution Imaging , 2013, J. Electronic Imaging.

[32]  Yucel Altunbasak,et al.  Face recognition with independent component-based super-resolution , 2006, Electronic Imaging.

[33]  Kwang In Kim,et al.  Example-Based Learning for Single-Image Super-Resolution , 2008, DAGM-Symposium.

[34]  P. Ekman,et al.  Facial action coding system: a technique for the measurement of facial movement , 1978 .

[35]  Dongqing Zhang,et al.  Neural Aggregation Network for Video Face Recognition , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Takeshi Norimatsu,et al.  Encoding and Decoding , 2016 .

[37]  K. Prkachin,et al.  Assessing pain behaviour of low-back pain patients in real time: concurrent validity and examiner sensitivity. , 2002, Behaviour research and therapy.

[38]  Ryszard Stasinski,et al.  Toeplitz-based iterative image fusion scheme for MRI , 2008, 2008 15th IEEE International Conference on Image Processing.

[39]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[40]  Miriam Kunz,et al.  Decoding pain from the facial display of patients with dementia: a comparison of professional and nonprofessional observers. , 2013, Pain medicine.

[41]  Miriam Kunz,et al.  Smiling in Pain: Explorations of Its Social Motives , 2013, Pain research and treatment.

[42]  Miriam Kunz,et al.  On the relationship between self-report and facial expression of pain. , 2004, The journal of pain : official journal of the American Pain Society.

[43]  Thomas B. Moeslund,et al.  Super-resolution: a comprehensive survey , 2014, Machine Vision and Applications.

[44]  Sridha Sridharan,et al.  Automatically Detecting Pain in Video Through Facial Action Units , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[45]  B.V.K.V. Kumar,et al.  Recognition of Low-Resolution Faces Using Multiple Still Images and Multiple Cameras , 2008, 2008 IEEE Second International Conference on Biometrics: Theory, Applications and Systems.

[46]  William T. Freeman,et al.  Example-Based Super-Resolution , 2002, IEEE Computer Graphics and Applications.

[47]  A H Vallerand,et al.  The relationship of gender to pain. , 2000, Pain management nursing : official journal of the American Society of Pain Management Nurses.

[48]  Thomas B. Moeslund,et al.  Constructing facial expression log from video sequences using face quality assessment , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[49]  K. Prkachin Facial pain expression. , 2011, Pain management.

[50]  Thomas B. Moeslund,et al.  Real-time acquisition of high quality face sequences from an active pan-tilt-zoom camera , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[51]  Vittorio Murino,et al.  Distilling information with super-resolution for video surveillance , 2004, VSSN '04.

[52]  Yücel Altunbasak,et al.  Eigenface-domain super-resolution for face recognition , 2003, IEEE Trans. Image Process..

[53]  Gang Hua,et al.  Hierarchical-PEP model for real-world face recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  Jeffrey F. Cohn,et al.  Painful data: The UNBC-McMaster shoulder pain expression archive database , 2011, Face and Gesture 2011.

[55]  Haim Azhari,et al.  Super-resolution in PET imaging , 2006, IEEE Transactions on Medical Imaging.

[56]  K. Prkachin,et al.  Encoding and decoding of pain expressions: a judgement study , 1994, Pain.

[57]  S Peled,et al.  Superresolution in MRI: Application to human white matter fiber tract visualization by diffusion tensor imaging , 2001, Magnetic resonance in medicine.

[58]  Sergio Escalera,et al.  Spatiotemporal analysis of RGB-D-T facial images for multimodal pain level recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[59]  M. Kunz,et al.  The facial expression of pain in patients with dementia , 2007, Pain.