论文信息 - Joint Angles Similiarities and HOG 2 for Action Recognition

Joint Angles Similiarities and HOG 2 for Action Recognition

We propose a set of features derived from skeleton tracking of the human body and depth maps for the purpose of action recognition. The descriptors proposed are easy to implement, produce relatively small-sized feature sets, and the multi-class classification scheme is fast and suitable for real-time applications. We intuitively characterize actions using pairwise affinities between view-invariant joint angles features over the performance of an action. Additionally, a new descriptor for spatio-temporal feature extraction from color and depth images is introduced. This descriptor involves an application of a modified histogram of oriented gradients (HOG) algorithm. The application produces a feature set at every frame, and these features are collected into a 2D array which then the same algorithm is applied to again (the approach is termed HOG 2 ). Both feature sets are evaluated in a bag-of-words scheme using a linear SVM, showing state-of-the-art results on public datasets from different domains of human-computer interaction.

Eshed Ohn-Bar | M. Trivedi

[1] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2] Meinard Müller,et al. Motion templates for automatic classification and retrieval of motion capture data , 2006, SCA '06.

[3] Ramakant Nevatia,et al. Recognition and Segmentation of 3-D Human Action Using HMM and Multi-class AdaBoost , 2006, ECCV.

[4] Cordelia Schmid,et al. Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Václav Hlavác,et al. Pose primitive based human action recognition in videos or still images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Cordelia Schmid,et al. A Spatio-Temporal Descriptor Based on 3D-Gradients , 2008, BMVC.

[7] François Brémond,et al. Tracking HoG Descriptors for Gesture Recognition , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[8] Mohan M. Trivedi,et al. Vision-Based Infotainment User Determination by Hand Recognition for Driver Assistance , 2010, IEEE Transactions on Intelligent Transportation Systems.

[9] Wanqing Li,et al. Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[10] Juan Carlos Niebles,et al. Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.

[11] Wei Liang,et al. Discriminative human action recognition in the learned hierarchical manifold space , 2010, Image Vis. Comput..

[12] J.K. Aggarwal,et al. Human activity analysis , 2011, ACM Comput. Surv..

[13] Dimitris Samaras,et al. Two-person interaction detection using body-pose features and multiple instance learning , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[14] Ying Wu,et al. Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15] Mohan M. Trivedi,et al. Human Pose Estimation and Activity Recognition From Multi-View Videos: Comparative Explorations of Recent Developments , 2012, IEEE Journal of Selected Topics in Signal Processing.

[16] Xiaodong Yang,et al. Recognizing actions using depth motion maps-based histograms of oriented gradients , 2012, ACM Multimedia.

[17] Mohan M. Trivedi,et al. Hand gesture-based visual user interface for infotainment , 2012, AutomotiveUI.

[18] Mohan M. Trivedi,et al. 3-D Posture and Gesture Recognition for Interactivity in Smart Spaces , 2012, IEEE Transactions on Industrial Informatics.

[19] Joseph J. LaViola,et al. Exploring the Trade-off Between Accuracy and Observational Latency in Action Recognition , 2013, International Journal of Computer Vision.

[20] Ying Wu,et al. Robust 3D Action Recognition with Random Occupancy Patterns , 2012, ECCV.

[21] Zicheng Liu,et al. HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[22] Mohan M. Trivedi,et al. In-vehicle hand activity recognition using integration of regions , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).