Human activity recognition using deep belief networks

Human activity recognition using new generation depth sensors are particularly important for application that require human activity recognition. In this paper, a deep learning based algorithm is developed human activity recognition using RGB-D video sequences. Based on the assumption that every human activity is composed of many smaller actions, a temporal structure is being learnt in order to improve the classification of human activities. Since our approach is an attempt to develop a deep learning structure to the problem, it can be considered as a deep structural arhitecture. A deep neural network is obtained manipulating the activitation functions which yield hidden variables at every hidden layer. Our approach outperforms the methods that are constructed upon engineered features, since it uses the skeleton coordinates extracted from depth images. Tested on a new dataset, it is observed that our appproach outputs better recognition rates compared to those of other state-of-art methods.

[1]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[2]  Wanqing Li,et al.  Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[3]  Bart Selman,et al.  Unstructured human activity detection from RGBD images , 2011, 2012 IEEE International Conference on Robotics and Automation.

[4]  Thomas Hofmann,et al.  Greedy Layer-Wise Training of Deep Networks , 2007 .

[5]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[6]  Zicheng Liu,et al.  HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Hong Wei,et al.  A survey of human motion analysis using depth imagery , 2013, Pattern Recognit. Lett..

[8]  Antoni B. Chan,et al.  Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  Daniel P. Siewiorek,et al.  Activity recognition and monitoring using multiple sensors on different body positions , 2006, International Workshop on Wearable and Implantable Body Sensor Networks (BSN'06).

[10]  Cristian Sminchisescu,et al.  Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[13]  Ying Wu,et al.  Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Jake K. Aggarwal,et al.  View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Hulya Yalcin,et al.  Recognition and classification of human activity from RGB-D videos , 2015, 2015 23nd Signal Processing and Communications Applications Conference (SIU).