Multiple classifier-based spatiotemporal features for living activity prediction

ABSTRACT Nowadays, the action prediction technique plays an important role in many automatic systems. There are some proposed methods for this issue. However, they retain limitations such as accuracy and computational time, especially for applying in limited resource systems. This paper presents an approach to enhance the efficiency of the activity prediction task. The work processes on multiple classifiers using spatiotemporal features based on scalable feature descriptors, such as histogram of oriented gradients (HOG), histogram of oriented optical flow (HOF), and motion boundary histogram (MBH). In order to improve prediction accuracy, two layers of classified machine models are studied for applying on spatiotemporal features with the dynamic foreground extraction process. The first layer based on unsupervised classification is proposed to construct a dictionary of features, which supports for distinguishing and uniform the number of features. The next task is that supervised machine learning supports for final decision of action classes. The proposed approach was evaluated on several benchmark datasets, which are available online. The results demonstrate that the approach enhances accuracy and efficiency of the prediction system.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Ioannis Patras,et al.  Action recognition using saliency learned from recorded human gaze , 2016, Image Vis. Comput..

[3]  Jing Zhang,et al.  RGB-D-based action recognition datasets: A survey , 2016, Pattern Recognit..

[4]  Yang Yi,et al.  Human action recognition with graph-based multiple-instance learning , 2016, Pattern Recognit..

[5]  Yun Fu,et al.  Low-Rank Tensor Subspace Learning for RGB-D Action Recognition , 2016, IEEE Transactions on Image Processing.

[6]  Álvaro Herrero,et al.  Features and models for human activity recognition , 2015, Neurocomputing.

[7]  Rajiv Kapoor,et al.  Hybrid classifier based human activity recognition using the silhouette and cells , 2015, Expert Syst. Appl..

[8]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[9]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[10]  Wesley De Neve,et al.  Effective and efficient human action recognition using dynamic frame skipping and trajectory rejection , 2017, Image Vis. Comput..

[11]  Ruifeng Li,et al.  Gradient-layer feature transform for action detection and recognition , 2016, J. Vis. Commun. Image Represent..

[12]  Robert Bergevin,et al.  Semantic human activity recognition: A literature review , 2015, Pattern Recognit..

[13]  Van-Dung Hoang,et al.  Motion Segmentation Using Optical Flow for Pedestrian Detection from Moving Vehicle , 2014, ICCCI.

[14]  Youssef Iraqi,et al.  Weighted averaging fusion for multi-view skeletal data and its application in action recognition , 2016, IET Comput. Vis..

[15]  Fang Liu,et al.  Simple to Complex Transfer Learning for Action Recognition , 2016, IEEE Transactions on Image Processing.

[16]  Antonio Fernández-Caballero,et al.  A survey of video datasets for human action and activity recognition , 2013, Comput. Vis. Image Underst..

[17]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[18]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[19]  Feng Shi,et al.  Local part model for action recognition , 2016, Image Vis. Comput..

[20]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[21]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Cordelia Schmid,et al.  Actions in context , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Yun Fu,et al.  Discriminative Relational Representation Learning for RGB-D Action Recognition , 2016, IEEE Transactions on Image Processing.

[24]  Mubarak Shah,et al.  UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.