CNN-SkelPose: a CNN-based skeleton estimation algorithm for clinical applications

Computer vision based patient activity monitoring systems can be attractive for various unobtrusive clinical applications. Such a monitoring system can be developed using movement information derived from the skeleton model of the current body pose, e.g. obtained using a depth camera. Earlier research using estimated skeleton models have been focused mostly on gaming applications. In this paper, we propose CNN-SkelPose as a skeleton model estimation method for clinical applications. CNN-SkelPose uses a trained Convolutional Neural Network to extract both the local and global information from the depth image. CNN-SkelPose outperforms the baseline model of Skeltrack for reliable skeleton model estimation in patient monitoring scenarios. Our results show the inadequacy of existing methods for skeleton model estimation when applied to a clinical scenario and suggests CNN-SkelPose as an improvement towards this application.

[1]  Yun Li,et al.  Detection of patient's bed statuses in 3D using a Microsoft Kinect , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[2]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[3]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[4]  C. Pollak,et al.  The role of actigraphy in the study of sleep and circadian rhythms. , 2003, Sleep.

[5]  Ming-Sui Lee,et al.  Multiparameter Sleep Monitoring Using a Depth Camera , 2012, BIOSTEC.

[6]  Gerard de Haan,et al.  Robust and Sensitive Video Motion Detection for Sleep Analysis , 2014, IEEE Journal of Biomedical and Health Informatics.

[7]  Sebastian Thrun,et al.  Real-time identification and localization of body parts from depth images , 2010, 2010 IEEE International Conference on Robotics and Automation.

[8]  Alan Godfrey,et al.  Motion Analysis in Delirium: A Novel Method of Clarifying Motoric Subtypes , 2007, Neurocase.

[9]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[10]  Adrienne Heinrich,et al.  Video based actigraphy and breathing monitoring from the bedside table of shared beds , 2015, J. Ambient Intell. Humaniz. Comput..

[11]  Alan Godfrey,et al.  Motion analysis in delirium: A wavelet based approach for sub classification , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[12]  Nassir Navab,et al.  Patient MoCap: Human Pose Estimation Under Blanket Occlusion for Hospital Monitoring Applications , 2016, MICCAI.

[13]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[14]  Nassir Navab,et al.  Robust Optimization for Deep Regression , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Gerard de Haan,et al.  Multi-distance motion vector clustering algorithm for video-based sleep analysis , 2013, 2013 IEEE 15th International Conference on e-Health Networking, Applications and Services (Healthcom 2013).

[16]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[17]  Qiu Qiang,et al.  Automated Recognition of Complex Agitation Behavior of Dementia Patients Using Video Camera , 2007, 2007 9th International Conference on e-Health Networking, Application and Services.

[18]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[19]  James M. Keller,et al.  Monitoring patients in hospital beds using unobtrusive depth sensors , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[20]  G. ÓLaighin,et al.  Motion analysis in delirium: a discrete approach in determining physical activity for the purpose of delirium motoric subtyping. , 2010, Medical engineering & physics.

[21]  Hans-Peter Seidel,et al.  A data-driven approach for real-time full body pose reconstruction from a depth camera , 2011, 2011 International Conference on Computer Vision.