RVM-based human action classification through Gabor and Haar feature extraction

Human action recognition plays a vital role in surveillance applications. Human action recognition is motivated by some of the applications such as video retrieval, video surveillance systems, human robot interaction, to interact with deaf and dumb people etc. The aim is to analyse the role of Adaboost in the process of recognising the human action by extracting the motion features using optical flow. Adaboost is a supervised learning method used to select the subset of frames with most discriminatory motion features. Saliency point computation is performed to assign a measure of interest to each visual unit. Mean shift algorithm is then used for tracking the objects. Gabor feature is the global feature that includes more detailed information of frequency and orientation. Haar feature is used to show the variation in the pixel. Relevance vector machine classification gives a probabilistic output through Bayesian inference. The proposed system reduces the computation time and provides a higher recognition rate in comparison with existing gentle boost-based recognition system.

[1]  U. N. Sinha Silhouette Based Human Motion Detection and Recognising their Actions from the Captured Video Streams , 2011 .

[2]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[3]  Maja Pantic,et al.  Spatiotemporal Localization and Categorization of Human Actions in Unsegmented Image Sequences , 2011, IEEE Transactions on Image Processing.

[4]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevance Vector Machine , 2001 .

[6]  Joni-Kristian Kämäräinen,et al.  Invariance properties of Gabor filter-based features-overview and applications , 2006, IEEE Transactions on Image Processing.

[7]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[8]  Alex Pentland,et al.  Human computing and machine understanding of human behavior: a survey , 2006, ICMI '06.

[9]  Ling Shao,et al.  Boosted key-frame selection and correlated pyramidal motion-feature representation for human action recognition , 2013, Pattern Recognit..

[10]  I. Patras,et al.  Spatiotemporal salient points for visual recognition of human actions , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  Masaaki Shibata,et al.  Moving Object Detection for Active Camera based on Optical Flow Distortion , 2008 .

[12]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[13]  S. Raju,et al.  RVM-Based Human Action Classification in Crowd through Projection and Star Skeletonization , 2009, EURASIP J. Image Video Process..

[14]  Chung-Lin Huang,et al.  Hybrid-boost learning for multi-pose face detection and facial expression recognition , 2008, Pattern Recognit..

[15]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Dennis Gabor,et al.  Theory of communication , 1946 .

[17]  Pinar Duygulu Sahin,et al.  Human Action Recognition Using Distribution of Oriented Rectangular Patches , 2007, Workshop on Human Motion.

[18]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[19]  Yannis Avrithis,et al.  Spatiotemporal Features for Action Recognition and Salient Event Detection , 2011, Cognitive Computation.

[20]  Jintao Li,et al.  Hierarchical spatio-temporal context modeling for action recognition , 2009, CVPR.

[21]  Xuelong Li,et al.  General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Maja Pantic,et al.  Sparse B-spline polynomial descriptors for human activity recognition , 2009, Image Vis. Comput..

[23]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[24]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2006, BMVC.

[25]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.