Stacked sparse autoencoder and history of binary motion image for human activity recognition

The recognition of human actions in a video sequence still remains a challenging task in the computer vision community. Several techniques have been proposed until today such as silhouette detection, local space-time features and optical flow techniques. In this paper, a supervised way followed by an unsupervised learning using the principle of the auto-encoder is proposed to address the problem. We introduce a new foreground detection architecture based on information extracted from the Gaussian mixture model (GMM) incorporating with the uniform motion of Magnitude of Optical Flow (MOF). Thus, we use a fast dynamic frame skipping technique to avoid frames that contain irrelevant motion, making it possible to decrease the computational complexity of silhouette extraction. Furthermore a new technique of representations to construct an informative concept for human action recognition based on the superposition of human silhouettes is presented. We called this approach history of binary motion image (HBMI).Our method has been evaluated by a classification on the Ixmas, Weizmann, and KTH datasets, the Sparce Stacked Auto-encoder (SSAE), an instance of a deep learning strategy, is presented for efficient human activities detection and the Softmax (SMC) for the classification. The objective of this classifier in deep learning is the learning of function hierarchies with higher-level functions at lower-level functions of the hierarchy to provide an agile, robust and simple method. The results prove the efficiency of our proposed approach with respect to the irregularity in the performance of an action shape distortion, change of point of view as well as significant changes of scale.

[1]  Mandar Kulkarni,et al.  Histogram-based foreground object extraction for indoor and outdoor scenes , 2010, ICVGIP '10.

[2]  T. Revathi,et al.  An optimised background modelling for efficient foreground extraction , 2017, Int. J. High Perform. Comput. Netw..

[3]  Mourad Zaied,et al.  Supervised Image Classification Using Deep Convolutional Wavelets Network , 2015, 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI).

[4]  Jin Qi,et al.  Learning Dictionaries of Sparse Codes of 3D Movements of Body Joints for Real-Time Human Activity Understanding , 2014, PloS one.

[5]  Nouzha Harrati,et al.  On the use of local motion information for human action recognition via feature selection , 2015, 2015 4th International Conference on Electrical Engineering (ICEE).

[6]  Gabriel Thomas,et al.  Human Activity Recognition using Binary Motion Image and Deep Learning , 2015, Procedia Computer Science.

[7]  Mourad Zaied,et al.  A dyadic multi-resolution deep convolutional neural wavelet network for image classification , 2018, Multimedia Tools and Applications.

[8]  Ruzena Bajcsy,et al.  Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[9]  Sheng Yu,et al.  Stratified pooling based deep convolutional neural networks for human action recognition , 2017, Multimedia Tools and Applications.

[10]  Tieniu Tan,et al.  Silhouette Analysis-Based Gait Recognition for Human Identification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Chokri Ben Amar,et al.  A hybrid approach for Content-Based Image Retrieval based on Fast Beta Wavelet network and fuzzy decision support system , 2016, Machine Vision and Applications.

[12]  Chao-Yu Chen,et al.  Arbitrary frame skipping transcoding through spatial-temporal complexity analysis , 2007, TENCON 2007 - 2007 IEEE Region 10 Conference.

[13]  Mourad Zaied,et al.  Human fall detection based on block matching and silhouette area , 2017, International Conference on Machine Vision.

[14]  Jenq-Neng Hwang,et al.  A Review on Video-Based Human Activity Recognition , 2013, Comput..

[15]  Chokri Ben Amar,et al.  A novel approach for drowsy driver detection using head posture estimation and eyes recognition system based on wavelet network , 2014, IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications.

[16]  Alexandros André Chaaraoui,et al.  Silhouette-based human action recognition using sequences of key poses , 2013, Pattern Recognit. Lett..

[17]  Chokri Ben Amar,et al.  Intelligent Approach to Train Wavelet Networks for Recognition System of Arabic Words , 2010, KDIR.

[18]  G. Mariem,et al.  Detection of Abnormal Movements of a Crowd in a Video Scene , 2022 .

[19]  Mourad Zaied,et al.  A deep stacked wavelet auto-encoders to supervised feature extraction to pattern classification , 2018, Multimedia Tools and Applications.

[20]  Junsong Yuan,et al.  Learning Actionlet Ensemble for 3D Human Action Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Gary R. Bradski,et al.  Motion segmentation and pose recognition with motion history gradients , 2000, Proceedings Fifth IEEE Workshop on Applications of Computer Vision.

[22]  Mourad Zaied,et al.  A deep convolutional neural wavelet network to supervised Arabic letter image classification , 2015, 2015 15th International Conference on Intelligent Systems Design and Applications (ISDA).

[23]  Jenq-Neng Hwang,et al.  Dynamic frame-skipping in video transcoding , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[24]  Chokri Ben Amar,et al.  Fast DCNN based on FWT, intelligent dropout and layer skipping for image retrieval , 2017, Neural Networks.

[25]  Zheng Chang,et al.  Research on Three-dimensional Motion History Image Model and Extreme Learning Machine for Human Body Movement Trajectory Recognition , 2015 .

[26]  Ramakant Nevatia,et al.  Action recognition in cluttered dynamic scenes using Pose-Specific Part Models , 2011, 2011 International Conference on Computer Vision.

[27]  Meng Bo,et al.  Human action recognition based on quaternion spatial-temporal convolutional neural network and LSTM in RGB videos , 2018 .

[28]  Ramakant Nevatia,et al.  Single View Human Action Recognition using Key Pose Matching and Viterbi Path Searching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Gang Yu,et al.  Discriminative Orderlet Mining for Real-Time Recognition of Human-Object Interaction , 2014, ACCV.

[30]  Chokri Ben Amar,et al.  Face recognition based on Beta 2D Elastic Bunch Graph Matching , 2013, 13th International Conference on Hybrid Intelligent Systems (HIS 2013).

[31]  Debotosh Bhattacharjee,et al.  A Novel Approach for Human Action Recognition from Silhouette Images , 2015, ArXiv.

[32]  Nouzha Harrati,et al.  Encoding Human Motion for Automated Activity Recognition in Surveillance Applications , 2018 .

[33]  Qian-Qian Wu,et al.  Study of Human Action Recognition Based on Improved Spatio-temporal Features , 2014, Int. J. Autom. Comput..

[34]  Xiaolin Wang,et al.  Human action recognition based on quaternion spatial-temporal convolutional neural network and LSTM in RGB videos , 2018, Multimedia Tools and Applications.

[35]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[36]  Mourad Zaied,et al.  Deep learning architecture for recognition of abnormal activities , 2018, International Conference on Machine Vision.

[37]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[39]  Chokri Ben Amar,et al.  A speech recognition system based on hybrid wavelet network including a fuzzy decision support system , 2015, Other Conferences.

[40]  Mourad Zaied,et al.  Sparse Wavelet Auto-Encoders for Image Classification , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[41]  Mourad Zaied,et al.  A sparse representation-based approach for copy-move image forgery detection in smooth regions , 2017, International Conference on Machine Vision.

[42]  B. S. Manjunath,et al.  Probabilistic subspace-based learning of shape dynamics modes for multi-view action recognition , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[43]  Anil M. Cheriyadat,et al.  Unsupervised Feature Learning for Aerial Scene Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[44]  Belkacem Fergani,et al.  Evaluating a new classification method using PCA to human activity recognition , 2013, 2013 International Conference on Computer Medical Applications (ICCMA).

[45]  Tae-Seong Kim,et al.  Depth video-based human activity recognition system using translation and scaling invariant features for life logging at smart home , 2012, IEEE Transactions on Consumer Electronics.

[46]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[47]  Zicheng Liu,et al.  Expandable Data-Driven Graphical Modeling of Human Actions Based on Salient Postures , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[48]  Wesley De Neve,et al.  Effective and efficient human action recognition using dynamic frame skipping and trajectory rejection , 2017, Image Vis. Comput..

[49]  Dit-Yan Yeung,et al.  Human action recognition using Local Spatio-Temporal Discriminant Embedding , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Ling Shao,et al.  Action recognition by spatio-temporal oriented energies , 2014, Inf. Sci..

[51]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  Mourad Zaied,et al.  Abnormal events’ detection in crowded scenes , 2018, Multimedia Tools and Applications.