Cross-View Action Recognition by Projection-Based Augmentation

Challenging issue in cross-view action recognition is the difference between training viewpoint and testing viewpoint. Existing research deals with this problem by transferring knowledge, i.e., finding a viewpoint independent latent space in which action descriptors from different viewpoints are directly comparable. In this paper, we propose a novel approach to tackle the problem based on exploiting the discrimination in action execution through various viewpoints. We take the advantages of depth data to augment viewpoints from an initial camera viewpoint. In our framework, the local motion features and dedicated classifiers are built from the augmented viewpoints. We conduct experiments on the benchmark dataset, the Northwestern-UCLA Multiview Action 3D N-UCLA3D dataset. The experimental results indicated that our proposed method leads to outperform the state-of-the-art on the benchmark. In addition, we show the important role of viewpoints to improve the performance of action recognition.

[1]  Yiannis Aloimonos,et al.  Animated Heads: From 3D Motion Fields to Action Descriptions , 2000, DEFORM/AVATARS.

[2]  Jingjing Zheng,et al.  Learning View-Invariant Sparse Representations for Cross-View Action Recognition , 2013, 2013 IEEE International Conference on Computer Vision.

[3]  Silvio Savarese,et al.  Cross-view action recognition via view knowledge transfer , 2011, CVPR 2011.

[4]  Rémi Ronfard,et al.  Free viewpoint action recognition using motion history volumes , 2006, Comput. Vis. Image Underst..

[5]  James J. Little,et al.  Unlabelled 3D Motion Examples Improve Cross-View Action Recognition , 2014, BMVC.

[6]  Sridha Sridharan,et al.  The Backfilled GEI - A Cross-Capture Modality Gait Feature for Frontal and Side-View Gait Recognition , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[7]  Martial Hebert,et al.  Trajectons: Action recognition through the motion analysis of tracked features , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[8]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.

[9]  Chunheng Wang,et al.  Cross-View Action Recognition via a Continuous Virtual Path , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Arif Mahmood,et al.  Histogram of Oriented Principal Components for Cross-View Action Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[12]  Yasushi Yagi,et al.  Depth-based Gait Feature Representation , 2013, IPSJ Trans. Comput. Vis. Appl..

[13]  Binlong Li,et al.  Cross-view activity recognition using Hankelets , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Andrew Zisserman,et al.  The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.

[15]  Rama Chellappa,et al.  Domain adaptation for object recognition: An unsupervised approach , 2011, 2011 International Conference on Computer Vision.

[16]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[17]  Rama Chellappa,et al.  Cross-View Action Recognition via a Transferable Dictionary Pair , 2012, BMVC.

[18]  Ying Wu,et al.  Cross-View Action Modeling, Learning, and Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Christopher Joseph Pal,et al.  Activity recognition using the velocity histories of tracked keypoints , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[20]  Ajmal S. Mian,et al.  Learning a non-linear knowledge transfer model for cross-view action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[22]  Cordelia Schmid,et al.  Dense Trajectories and Motion Boundary Descriptors for Action Recognition , 2013, International Journal of Computer Vision.

[23]  Ruonan Li,et al.  Discriminative virtual views for cross-view action recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.