Semi-Supervised Learning in Random Forest Classifier for Human Action Recognition

The objective of human action recognition is the interpretation of ongoing events and context from video data for automated systems. In this paper, motion history image (MHI) is used as the region of interest (ROI) of action during the training phase to recognize human actions effectively. Therefore, the extracted spatio-temporal interest points (STIPs), that are used to train the classifier model, are free from noisy interest points due to the clutter background and illumination changes. After extracting the STIPs, the histogram of oriented gradient (HOG) and histogram of optical flow (HOF) features are calculated for the video patches extracted around the STIPs. Action recognition is performed by calculating mutual information of each STIP with respect to all the action classes provided in the training dataset. Mutual information is calculated by using random forest voting. The trees of random forest are built through proposed semi-supervised learning. The tree nodes are split by using unsupervised learning upto a certain predefined depth of the tree by taking the maximum variance of feature differences of the hypothesis. Next, the splitting process of the nodes is carried out by binary error minimization technique based on supervised learning. The experiments are performed on the standard KTH dataset. The performance of proposed technique is 95% which is better compared to earlier reported methods. Further, similar action classes from Weizmann dataset are tested on the same KTH trained forest model and the results are relevantly comparable with the state-of-the-art methods.

[1]  Jifeng Sun,et al.  Human action recognition based on feature level fusion and random projection , 2016, 2016 5th International Conference on Computer Science and Network Technology (ICCSNT).

[2]  Ying Wu,et al.  Discriminative subvolume search for efficient action detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Muhammad Haroon Yousaf,et al.  Multi-view Human Action Recognition Using Histograms of Oriented Gradients (HOG) Description of Motion History Images (MHIs) , 2015, 2015 13th International Conference on Frontiers of Information Technology (FIT).

[4]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.

[5]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Chin-Pan Huang,et al.  Human Action Recognition Using Histogram of Oriented Gradient of Motion History Image , 2011, 2011 First International Conference on Instrumentation, Measurement, Computer, Communication and Control.

[7]  Gang Yu,et al.  Unsupervised random forest indexing for fast action search , 2011, CVPR 2011.

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  H. Ebrahimpour-Komleh,et al.  Shape-based human action recognition using multi-input topology of deep belief networks , 2017, 2017 9th International Conference on Information and Knowledge Technology (IKT).

[10]  Andrea Cavallaro,et al.  Video-Based Human Behavior Understanding: A Survey , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  B. Jagadeesh,et al.  Video based action detection and recognition human using optical flow and SVM classifier , 2016, 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[12]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[13]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[14]  Gang Yu,et al.  Fast Action Detection via Discriminative Random Forest Voting and Top-K Subvolume Search , 2011, IEEE Transactions on Multimedia.

[15]  Dmitry B. Goldgof,et al.  Understanding Transit Scenes: A Survey on Human Behavior-Recognition Algorithms , 2010, IEEE Transactions on Intelligent Transportation Systems.

[16]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[17]  Md. Atiqur Rahman Ahad,et al.  Motion history image: its variants and applications , 2012, Machine Vision and Applications.

[18]  Antonio Criminisi,et al.  Object Class Segmentation using Random Forests , 2008, BMVC.

[19]  Supavadee Aramvith,et al.  Human action recognition using direction histograms of optical flow , 2011, 2011 11th International Symposium on Communications & Information Technologies (ISCIT).