Histogram of Body Poses and Spectral Regression Discriminant Analysis for Human Action Categorization

This paper explores a recently proposed and rarely reported subspace learning method, Spectral Regression Discriminant Analysis (SRDA) [1, 2], on silhouette based human action recognition. The recognition algorithm adopts the Bag of Words (BoW) model combined with the action representation based on Histogram of Body Poses sampled from silhouettes in the video sequence. In addition, we compare the performance of SRDA for dimensionality reduction with several traditional subspace learning methods, such as Principle Component Analysis (PCA), supervised Locality Preserving Projections (LPP), unsupervised LPP and Neighbourhood Preserving Embedding (NPE). Experimental results show that Histogram of Human Poses combined with SRDA or its kernel version, SRKDA, can achieve 100% recognition accuracy for the Weizmann human action dataset, which is better than any published results on the same dataset.

[1]  Ling Shao,et al.  Eigen-space learning using semi-supervised diffusion maps for human action recognition , 2010, CIVR '10.

[2]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[3]  Jiawei Han,et al.  SRDA: An Efficient Algorithm for Large-Scale Discriminant Analysis , 2008, IEEE Transactions on Knowledge and Data Engineering.

[4]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[5]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[6]  Jake K. Aggarwal,et al.  Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[7]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[8]  Jiawei Han,et al.  Regularized locality preserving indexing via spectral regression , 2007, CIKM '07.

[9]  Dit-Yan Yeung,et al.  Human action recognition using Local Spatio-Temporal Discriminant Embedding , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[11]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[13]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[14]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[15]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[16]  Jiawei Han,et al.  Spectral Regression for Efficient Regularized Subspace Learning , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[17]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[18]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[20]  Gary L. Miller,et al.  Graph Embeddings and Laplacian Eigenvalues , 2000, SIAM J. Matrix Anal. Appl..

[21]  Matti Pietikäinen,et al.  Texture Based Description of Movements for Activity Analysis , 2008, VISAPP.

[22]  David G. Stork,et al.  Pattern Classification , 1973 .

[23]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[24]  Liang Wang,et al.  Recognizing Human Activities from Silhouettes: Motion Subspace and Factorial Discriminative Graphical Model , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[26]  Ling Shao,et al.  Feature detector and descriptor evaluation in human action recognition , 2010, CIVR '10.

[27]  Cordelia Schmid,et al.  Actions in context , 2009, CVPR.

[28]  Liang Wang,et al.  Learning and Matching of Dynamic Shape Manifolds for Human Action Recognition , 2007, IEEE Transactions on Image Processing.

[29]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[30]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.