Human actions recognition using bag of optical flow words

In this paper, we present an improved approach to recognize human action based on the BOW model and the pLSA model. We propose an improved feature with optical flow to build our bag of words. This feature is able to reduce the high dimension of the pure optical flow template and also has abundant motion information. Then, we use the topic model of pLSA (probabilistic Latent Semantic Analysis) to classify human actions in a special way. We find that the existing methods lead to some mismatching of words due to the k-means clustering approach. To reduce the probability of mismatching, we add the spatial information to each word and improve the training and testing approach. Our approach of recognition is tested on two datasets, the KTH datasets and WEIZMANN datasets. The result shows its good performance.

[1]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[3]  Pietro Perona,et al.  Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[4]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[6]  Yang Wang,et al.  Human Action Recognition by Semilatent Topic Models , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[9]  Alberto Del Bimbo,et al.  Recognizing human actions by fusing spatio-temporal appearance and motion descriptors , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[10]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories , 2006 .

[11]  Pinar Duygulu Sahin,et al.  Human action recognition with line and flow histograms , 2008, 2008 19th International Conference on Pattern Recognition.

[12]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[13]  Zhenjiang Miao,et al.  Real Time Human Action Recognition in a Long Video Sequence , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.