Extraction of action patterns using local temporal self-similarities of skeletal body-joints

The RGB-Depth data has resulted in a great improvement on the task of human pose estimation, however, additional step is still necessary to interpret sequential human poses into more informative actions. In this paper, we explore extracting action patterns using temporal self-similarity from time sequential skeletons recovered from such data. For each body joint, action patterns are extracted locally in the temporal extent of a given video. Then, the standard bag-of-words framework is employed to assemble these local patterns for action modeling. Action recognition is performed using Naive-Bayes-Nearest-Neighbors classifier with also considering the spatial independence of body joints. Experimental result on the benchmarking dataset: UCF Kinect dataset, suggested the effectiveness and promise of the proposed action patterns.

[1]  Pascal Fua,et al.  Making Action Recognition Robust to Occlusions and Viewpoint Changes , 2010, ECCV.

[2]  Mineichi Kudo,et al.  Self-Similarities in Difference Images: A New Cue for Single-Person Oriented Action Recognition , 2013, IEICE Trans. Inf. Syst..

[3]  Peyman Milanfar,et al.  Detection of human actions from a single example , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[4]  Ling Shao,et al.  Learning Discriminative Representations from RGB-D Video Data , 2013, IJCAI.

[5]  Yih-Fang Huang,et al.  A constrained vector quantization scheme for real-time codebook retransmission , 1994, IEEE Trans. Circuits Syst. Video Technol..

[6]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[7]  Joseph J. LaViola,et al.  Exploring the Trade-off Between Accuracy and Observational Latency in Action Recognition , 2013, International Journal of Computer Vision.

[8]  Mineichi Kudo,et al.  Learning action patterns in difference images for efficient action recognition , 2014, Neurocomputing.

[9]  Rama Chellappa,et al.  Machine Recognition of Human Activities: A Survey , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[11]  Patrick Pérez,et al.  View-Independent Action Recognition from Temporal Self-Similarities , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Xiaodong Yang,et al.  Effective 3D action recognition using EigenJoints , 2014, J. Vis. Commun. Image Represent..

[13]  Qiuqi Ruan,et al.  Activity Recognition from RGB-D Camera with 3D Local Spatio-temporal Features , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[14]  Mario Fernando Montenegro Campos,et al.  STOP: Space-Time Occupancy Patterns for 3D Action Recognition from Depth Map Sequences , 2012, CIARP.

[15]  Adriana Kovashka,et al.  Learning a hierarchy of discriminative space-time neighborhood features for human action recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Behzad Dariush,et al.  Kinematic self retargeting: A framework for human pose estimation , 2010, Comput. Vis. Image Underst..

[17]  Ling Shao,et al.  Silhouette Analysis-Based Action Recognition Via Exploiting Human Poses , 2013, IEEE Transactions on Circuits and Systems for Video Technology.