Support tensor action spotting

In this paper we address the action spotting problem, that is the spatiotemporal detection and localization of an action. We first calculate a novel objective function between an input video sequence and an action's weights tensor, as acquired from a Support Tensor Machine classifier. We subsequently search for an appropriate transformation that maximizes the objective function, calculated as the multiplication of the original input tensor with the weights tensor. The proposed algorithm is very fast, as the above mentioned multiplication involves a set of a separable filters applied along each mode. We demonstrate the effectiveness of our method with experiments in a publicly available database where we show that our method outperforms existing techniques in terms of spatiotemporal action localization.

[1]  Maja Pantic,et al.  Spatiotemporal Localization and Categorization of Human Actions in Unsegmented Image Sequences , 2011, IEEE Transactions on Image Processing.

[2]  Jun Cheng,et al.  Vision-based hand gesture spotting and recognition , 2010, 2010 International Conference on Information, Networking and Automation (ICINA).

[3]  Stan Sclaroff,et al.  A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Ayoub Al-Hamadi,et al.  Hand trajectory-based gesture spotting and recognition using HMM , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[5]  Simon Baker,et al.  Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[6]  Gang Qian,et al.  Online Gesture Spotting from Visual Hull Data , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Michal Irani,et al.  Aligning Sequences and Actions by Maximizing Space-Time Correlations , 2006, ECCV.

[8]  Irene Kotsia,et al.  Relative Margin Support Tensor Machines for gait and action recognition , 2010, CIVR '10.

[9]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[10]  Richard P. Wildes,et al.  Efficient action spotting based on a spacetime oriented structure representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.