Pose-Invariant Face Detection by Replacing Deep Neurons with Capsules for Thermal Imagery in Telemedicine

The aim of this work was to examine the potential of thermal imaging as a cost-effective tool for convenient, nonintrusive remote monitoring of elderly people in different possible head orientations, without imposing specific behavior on users, e.g., looking toward the camera. Illumination and pose invariant head tracking is important for many medical applications as it can provide information, e.g., about vital signs, sensory experiences, injuries, wellbeing. In the performed experiments, we investigated the influence of different modifications of images (rotation, displacement of facial features, and displacement of facial quarters) on the prediction accuracy. Specifically, two models were tested on the set of collected low-resolution thermal images: Inception V3 Convolutional Neural Network (CNN) and Hinton’s Capsule Network. The preliminary results confirm that the prediction ability of the model based on capsules can deal with different head orientations much better than CNN (for the 45° head rotation Capsule Network achieved ~% accuracy while CNN only 9.5%).

[1]  Su Jin Lee,et al.  KoMIPS: a Web-based medical image processing system for telemedicine applications , 2002, 2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering. TENCOM '02. Proceedings..

[2]  Roderick McCall,et al.  D2.1 The Specification and Overall Requirements of theeGlasses Platform , 2014 .

[3]  Peter M. Prendergast,et al.  Anatomy of the Face and Neck , 2013 .

[4]  Li-Jia Li,et al.  Multi-view Face Detection Using Deep Convolutional Neural Networks , 2015, ICMR.

[5]  Tomasz Kocejko,et al.  Self diagnostics using smart glasses - preliminary study , 2016, 2016 9th International Conference on Human System Interactions (HSI).

[6]  Sachiko Kamiyama The Super Aged Society : Japan's the KAITEKI Institute studies how to keep the elderly healthy and active , 2014, IEEE Pulse.

[7]  H. Farhadi,et al.  The Effectiveness of Infrared Thermography in Patients with Whiplash Injury , 2015, Journal of Korean Neurosurgical Society.

[8]  Denis Laurendeau,et al.  Highly Accurate and Fully Automatic Head Pose Estimation from a Low Quality Consumer-Level RGB-D Sensor , 2015, HCMC '15.

[9]  B. Venkatraman,et al.  Potential of thermal imaging as a tool for prediction of cardiovascular disease , 2014, Journal of medical physics.

[10]  Agostino Villa,et al.  Performance evaluation of local healthcare systems by applying industrial management methods , 2010, 2010 IEEE Workshop on Health Care Management (WHCM).

[11]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[13]  汤晓鸥 Deep Convolutional Network Cascade for Facial Point Detection , 2013 .

[14]  Gang Hua,et al.  A convolutional neural network cascade for face detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Yang Jin,et al.  Capsule Network Performance on Complex Data , 2017, ArXiv.

[16]  Yau Hee Kho,et al.  Assistive healthcare home monitoring system for elderly people , 2016, 2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT).

[17]  Tomasz Kocejko,et al.  Proceedings of the Federated Conference on Computer Science and Information Systems pp. 405–410 ISBN 978-83-60810-22-4 Measuring Pulse Rate with a Webcam – a Non-contact Method for Evaluating Cardiac Activity , 2022 .

[18]  Mahnaz Etehadtavakol,et al.  Potential of Thermography in Pain Diagnosing and Treatment Monitoring , 2017 .

[19]  Bo Dong,et al.  Swallow monitoring through apnea detection in breathing signal , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[20]  Krzysztof Czuszynski,et al.  Real-Time Facial Features Detection from Low Resolution Thermal Images with Deep Classification Models , 2018 .

[21]  Luc Van Gool,et al.  Real-time face pose estimation from single range images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Simon J. Julier,et al.  DeepBreath: Deep learning of breathing patterns for automatic stress recognition using low-cost thermal imaging in unconstrained settings , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

[23]  Jacek Ruminski,et al.  Evaluation of Respiration Rate Using Thermal Imaging in Mobile Conditions , 2017 .

[24]  Junwei Han,et al.  Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[25]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Thomas B. Moeslund,et al.  Spatio-temporal Pain Recognition in CNN-Based Super-Resolved Facial Images , 2016, VAAM/FFER@ICPR.