论文信息 - View-robust action recognition based on temporal self-similarities and dynamic time warping

View-robust action recognition based on temporal self-similarities and dynamic time warping

In this paper, we propose an approach for human action recognition based on self-similarities of actions and dynamic-time warping method. To recognize actions under arbitrary views, we use a recent self-similarity matrix (SSM) method. Through analyzing the essence of SSMs we find that the SSMs capture a wealth of global time information useful for action recognition robust to viewpoints. The dynamic-time warping (DTW) algorithm is applied to make full use of the time information contained in SSMs. After performing DTW, we compute a collection of distances corresponding to mapped set of descriptors between the test sequence and all training sequences. Then the k-nearest neighbor classifier (KNNC) is implemented to classify the test action. We validated our method on the public multi-view IXMAS dataset and obtained promising results compared to the state-of-the-art bag-offeature-based method.

Jing Wang | Huicheng Zheng | Huicheng Zheng | Jing Wang

[1] V. Ramasubramanian,et al. Towards fast, view-invariant human action recognition , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[2] Honghai Liu,et al. Advances in View-Invariant Human Motion Analysis: A Review , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[3] Patrick Pérez,et al. View-Independent Action Recognition from Temporal Self-Similarities , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Mubarak Shah,et al. Learning 4D action feature models for arbitrary view action recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Patrick Pérez,et al. Cross-View Action Recognition from Temporal Self-similarities , 2008, ECCV.

[6] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[7] Juan Carlos Niebles,et al. Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2006, BMVC.

[8] Samsu Sempena,et al. Human action recognition using Dynamic Time Warping , 2011, Proceedings of the 2011 International Conference on Electrical Engineering and Informatics.

[9] Yang Yang,et al. View-invariant action recognition based on local linear dynamical system , 2010, International Congress on Image and Signal Processing.

[10] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11] Ali Farhadi,et al. Learning to Recognize Activities from the Wrong View Point , 2008, ECCV.

[12] Rémi Ronfard,et al. Action Recognition from Arbitrary Views using 3D Exemplars , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[13] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[14] Ali Farhadi,et al. A latent model of discriminative aspect , 2009, 2009 IEEE 12th International Conference on Computer Vision.