View-robust action recognition based on temporal self-similarities and dynamic time warping

In this paper, we propose an approach for human action recognition based on self-similarities of actions and dynamic-time warping method. To recognize actions under arbitrary views, we use a recent self-similarity matrix (SSM) method. Through analyzing the essence of SSMs we find that the SSMs capture a wealth of global time information useful for action recognition robust to viewpoints. The dynamic-time warping (DTW) algorithm is applied to make full use of the time information contained in SSMs. After performing DTW, we compute a collection of distances corresponding to mapped set of descriptors between the test sequence and all training sequences. Then the k-nearest neighbor classifier (KNNC) is implemented to classify the test action. We validated our method on the public multi-view IXMAS dataset and obtained promising results compared to the state-of-the-art bag-offeature-based method.

[1]  V. Ramasubramanian,et al.  Towards fast, view-invariant human action recognition , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[2]  Honghai Liu,et al.  Advances in View-Invariant Human Motion Analysis: A Review , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[3]  Patrick Pérez,et al.  View-Independent Action Recognition from Temporal Self-Similarities , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Mubarak Shah,et al.  Learning 4D action feature models for arbitrary view action recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Patrick Pérez,et al.  Cross-View Action Recognition from Temporal Self-similarities , 2008, ECCV.

[6]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[7]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2006, BMVC.

[8]  Samsu Sempena,et al.  Human action recognition using Dynamic Time Warping , 2011, Proceedings of the 2011 International Conference on Electrical Engineering and Informatics.

[9]  Yang Yang,et al.  View-invariant action recognition based on local linear dynamical system , 2010, International Congress on Image and Signal Processing.

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Ali Farhadi,et al.  Learning to Recognize Activities from the Wrong View Point , 2008, ECCV.

[12]  Rémi Ronfard,et al.  Action Recognition from Arbitrary Views using 3D Exemplars , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[13]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[14]  Ali Farhadi,et al.  A latent model of discriminative aspect , 2009, 2009 IEEE 12th International Conference on Computer Vision.