RGB-D Fall Detection via Deep Residual Convolutional LSTM Networks

The development of smart healthcare environments has witnessed impressive advancements exploiting the recent technological capabilities. Since falls are considered a major health concern especially among older adults, low-cost fall detection systems have become an indispensable component in these environments. This paper proposes an integrable, privacy preserving and efficient fall detection system from depth images acquired using a Kinect RGB-D sensor. The proposed system uses an end-to-end deep learning architecture composed of convolutional and recurrent neural networks to detect fall events. The deep convolutional network (ConvNet) analyses the human body and extracts visual features from input sequence frames. Fall events are detected via modeling complex temporal dependencies between subsequent frame features using Long-Shot-Term-Memory (LSTM) recurrent neural networks. Both models are combined and jointly trained in an end-to-end ConvLSTM architecture. This allows the model to learn visual representations and complex temporal dynamics of fall motions simultaneously. The proposed method has been validated on the public URFD fall detection dataset and compared with different approaches, including accelerometer based methods. We achieved a near unity sensitivity and specificity rates in detecting fall events.

[1]  Trevor Darrell,et al.  Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Nadia Magnenat-Thalmann,et al.  Fall Detection Based on Body Part Tracking Using a Depth Camera , 2015, IEEE Journal of Biomedical and Health Informatics.

[3]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Shuwan Xue,et al.  Portable Preimpact Fall Detector With Inertial Sensors , 2008, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[5]  Jean Meunier,et al.  3D head tracking for fall detection using a single calibrated camera , 2013, Image Vis. Comput..

[6]  Inmaculada Plaza,et al.  Challenges, issues and trends in fall detection systems , 2013, Biomedical engineering online.

[7]  Saeid Nahavandi,et al.  Cyclist detection in LIDAR scans using faster R-CNN and synthetic depth images , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[8]  Saeid Nahavandi,et al.  A Skeleton-Free Fall Detection System From Depth Images Using Random Decision Forest , 2018, IEEE Systems Journal.

[9]  Navdeep Jaitly,et al.  Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.

[10]  Saeid Nahavandi,et al.  Body joints regression using deep convolutional neural networks , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[11]  Saeid Nahavandi,et al.  RGB-D human posture analysis for ergonomie studies using deep convolutional neural network , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[12]  Michael Firman,et al.  RGBD Datasets: Past, Present and Future , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Marjorie Skubic,et al.  Fall Detection in Homes of Older Adults Using the Microsoft Kinect , 2015, IEEE Journal of Biomedical and Health Informatics.

[14]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[15]  Saeid Nahavandi,et al.  A kinect-based workplace postural analysis system using deep residual networks , 2017, 2017 IEEE International Systems Engineering Symposium (ISSE).

[16]  Jingjing Xiao,et al.  LGT/VOT tracking performance evaluation of depth images , 2014, 2014 9th International Conference on System of Systems Engineering (SOSE).

[17]  Eva Negri,et al.  Risk Factors for Falls in Community-dwelling Older People: A Systematic Review and Meta-analysis , 2010, Epidemiology.

[18]  Saeid Nahavandi,et al.  Semantic body parts segmentation for quadrupedal animals , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[19]  Saeid Nahavandi,et al.  Skin melanoma segmentation using recurrent and convolutional neural networks , 2017, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017).

[20]  Roland Siegwart,et al.  Kinect v2 for mobile robot navigation: Evaluation and modeling , 2015, 2015 International Conference on Advanced Robotics (ICAR).

[21]  Bogdan Kwolek,et al.  Improving fall detection by the use of depth sensor and accelerometer , 2015, Neurocomputing.

[22]  A. Bourke,et al.  Fall detection - Principles and Methods , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[23]  K. Samsudin,et al.  Evaluation of fall detection classification approaches , 2012, 2012 4th International Conference on Intelligent and Advanced Systems (ICIAS2012).

[24]  M. Skubic,et al.  Older adults' attitudes towards and perceptions of ‘smart home’ technologies: a pilot study , 2004, Medical informatics and the Internet in medicine.

[25]  M. Tinetti,et al.  Predictors and prognosis of inability to get up after falls among elderly persons. , 1993, JAMA.

[26]  Wolfram Burgard,et al.  Multimodal deep learning for robust RGB-D object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[27]  Yann LeCun,et al.  Indoor Semantic Segmentation using depth information , 2013, ICLR.

[28]  Saeid Nahavandi,et al.  Safety applications using Kinect technology , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Saeid Nahavandi,et al.  An adaptable system for RGB-D based human body detection and pose estimation: Incorporating attached props , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[31]  AGEinG And LifE CoursE , fAmiLy And Community HEALtH WHo Global report on falls Prevention in older Age , .

[32]  O. Celik,et al.  Systematic review of Kinect applications in elderly care and stroke rehabilitation , 2014, Journal of NeuroEngineering and Rehabilitation.

[33]  Joris De Schutter,et al.  An adaptable system for RGB-D based human body detection and pose estimation , 2014, J. Vis. Commun. Image Represent..

[34]  Jitendra Malik,et al.  Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.

[35]  Andrew Blake,et al.  Efficient Human Pose Estimation from Single Depth Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Saeid Nahavandi,et al.  Intent prediction of vulnerable road users from motion trajectories using stacked LSTM network , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).