Depth video-based human activity recognition system using translation and scaling invariant features for life logging at smart home

Video-based human activity recognition systems have potential contributions to various applications such as smart homes and healthcare services. In this work, we present a novel depth video-based translation and scaling invariant human activity recognition (HAR) system utilizing R transformation of depth silhouettes. To perform HAR in indoor settings, an invariant HAR method is critical to freely perform activities anywhere in a camera view without translation and scaling problems of human body silhouettes. We obtain such invariant features via R transformation on depth silhouettes. Furthermore, in R transforming depth silhouettes, shape information of human body reflected in depth values is encoded into the features. In R transformation, 2D feature maps are computed first through Radon transform of each depth silhouette followed by computing 1D feature profile through R transform to get the translation and scaling invariant features. Then, we apply Principle Component Analysis (PCA) for dimension reduction and Linear Discriminant Analysis (LDA) to make the features more prominent, compact and robust. Finally, Hidden Markov Models (HMMs) are used to train and recognize different human activities. Our proposed system shows superior recognition rate over the conventional approaches, reaching up to the mean recognition rate of 93.16% for six typical human activities whereas the conventional PC and IC-based depth silhouettes achieved only 74.83% and 86.33% ,while binary silhouettes-based R transformation approach achieved 67.08% respectively. Our experimental results show that the proposed method is robust, reliable, and efficient in recognizing the daily human activities.

[1]  Rabab Kreidieh Ward,et al.  A New Orientation-Adaptive Interpolation Method , 2007, IEEE Transactions on Image Processing.

[2]  Liang Wang,et al.  Learning and Matching of Dynamic Shape Manifolds for Human Action Recognition , 2007, IEEE Transactions on Image Processing.

[3]  Chang-Su Kim,et al.  Frame loss concealment for stereoscopic video plus depth sequences , 2011, IEEE Transactions on Consumer Electronics.

[4]  M. Mandal,et al.  Pose recognition using the Radon transform , 2005, 48th Midwest Symposium on Circuits and Systems, 2005..

[5]  Bruce A. Draper,et al.  Recognizing faces with PCA and ICA , 2003, Comput. Vis. Image Underst..

[6]  Dan Schonfeld,et al.  Object Trajectory-Based Activity Classification and Recognition Using Hidden Markov Models , 2007, IEEE Transactions on Image Processing.

[7]  Eric Campo,et al.  A review of smart homes - Present state and future challenges , 2008, Comput. Methods Programs Biomed..

[8]  Lei Chen,et al.  Gender Recognition from Gait Using Radon Transform and Relevant Component Analysis , 2009, ICIC.

[9]  Francisco José Madrid-Cuevas,et al.  Depth silhouettes for gesture recognition , 2008, Pattern Recognit. Lett..

[10]  Md. Zia Uddin,et al.  Independent shape component-based human activity recognition via Hidden Markov Model , 2010, Applied Intelligence.

[11]  Jin-Hyung Kim,et al.  An HMM-Based Threshold Model Approach for Gesture Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Yo-Sung Ho,et al.  Three-dimensional natural video system based on layered representation of depth maps , 2006, IEEE Transactions on Consumer Electronics.

[13]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[14]  Sang Hwa Lee,et al.  Real-time disparity estimation algorithm for stereo camera systems , 2011, IEEE Transactions on Consumer Electronics.

[15]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[16]  Alaa Eleyan,et al.  PCA and LDA Based Face Recognition Using Feedforward Neural Network Classifier , 2006, MRCS.

[17]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[18]  Laurent Wendling,et al.  A new shape descriptor defined on the Radon transform , 2006, Comput. Vis. Image Underst..

[19]  Hao Zhang,et al.  Recognizing Human Activities by Key Frame in Video Sequences , 2010, J. Softw..

[20]  Rama Chellappa,et al.  Appearance Modeling Using a Geometric Transform , 2009, IEEE Transactions on Image Processing.

[21]  Rama Chellappa,et al.  Machine Recognition of Human Activities: A Survey , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  M. A. Fiddy,et al.  The Radon Transform and Some of Its Applications , 1985 .

[23]  Hisham Othman,et al.  A Separable Low Complexity 2D HMM with Application to Face Recognition , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Avinash C. Kak,et al.  PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Peter H. N. de With,et al.  Automatic video-based human motion analyzer for consumer surveillance system , 2009, IEEE Transactions on Consumer Electronics.

[26]  Václav Matousek,et al.  HMM based handwritten text recognition using biometrical data acquisition pen , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).

[27]  Kwan H. Lee,et al.  Depth video based human model reconstruction resolving self-occlusion , 2010, IEEE Transactions on Consumer Electronics.

[28]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Junji Yamato,et al.  Recognizing human action in time-sequential images using hidden Markov model , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[30]  Xia Liu,et al.  Hand gesture recognition using depth data , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[31]  Nikolaos V. Boulgouris,et al.  Gait Recognition Using Radon Transform and Linear Discriminant Analysis , 2007, IEEE Transactions on Image Processing.

[32]  Ying Wang,et al.  Human Activity Recognition Based on R Transform , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Bir Bhanu,et al.  Human Activity Recognition in Thermal Infrared Imagery , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[34]  Jieping Ye,et al.  Generalized Linear Discriminant Analysis: A Unified Framework and Efficient Model Selection , 2008, IEEE Transactions on Neural Networks.