An Effective 3D Geometric Relational Feature Descriptor for Human Action Recognition

This paper presents an effective feature descriptor for recognizing human actions from three-dimension (3D) motion capture video sequences. The proposed feature descriptor is extended from the Boolean features which have been successfully used in computer animation. We first transform 3D coordinates of specified human points, as provided by the motion capture data system, into corresponding 3D points as defined in an articulated 3D human model. We then derive novel 3D geometric relational features, a numeric (continuous-valued) version of the Boolean features, to represent the geometric relations among body points of a pose. Finally, the proposed feature descriptor is applied in human action classification using the hidden Markov model. The simulation results indicate the effectiveness of the proposed feature descriptor as evidenced by the high recognition rate.

[1]  Michalis E. Zervakis,et al.  A survey of video processing techniques for traffic applications , 2003, Image Vis. Comput..

[2]  Wei-Yang Lin,et al.  Recognizing Human Actions Using NWFE-Based Histogram Vectors , 2010, EURASIP J. Adv. Signal Process..

[3]  Jürgen Schmidhuber,et al.  Novel Kernel-Based Recognizers of Human Actions , 2010, EURASIP J. Adv. Signal Process..

[4]  Michael J. Black,et al.  HumanEva: Synchronized Video and Motion Capture Dataset for Evaluation of Articulated Human Motion , 2006 .

[5]  Tieniu Tan,et al.  Recent developments in human motion analysis , 2003, Pattern Recognit..

[6]  Moon Gi Kang,et al.  Region Adaptive Color Demosaicing Algorithm Using Color Constancy , 2010, EURASIP J. Adv. Signal Process..

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  Rama Chellappa,et al.  Machine Recognition of Human Activities: A Survey , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Kuo-Chin Fan,et al.  Efficient Human Action and Gait Analysis Using Multiresolution Motion Energy Histogram , 2010, EURASIP J. Adv. Signal Process..

[10]  Jenq-Neng Hwang,et al.  Real-Time 3D Human Pose Estimation from Monocular View with Applications to Event Detection and Video Gaming , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[11]  Tido Röder,et al.  Efficient content-based retrieval of motion capture data , 2005, SIGGRAPH 2005.

[12]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[13]  Tido Röder Similarity, retrieval, and classification of motion capture data , 2006 .

[14]  Datong Chen,et al.  Infrastructure for Machine Understanding of Video Observations in Skilled Care Facilities – Implications of Early Results from CareMedia Case Studies , 2004 .

[15]  Sergio A. Velastin,et al.  Intelligent distributed surveillance systems: a review , 2005 .

[16]  Jaeho Lee,et al.  Human Action Recognition Using Ordinal Measure of Accumulated Motion , 2010, EURASIP J. Adv. Signal Process..

[17]  Jenq-Neng Hwang,et al.  View-invariant 3D human body pose reconstruction using a monocular video camera , 2011, 2011 Fifth ACM/IEEE International Conference on Distributed Smart Cameras.

[18]  Junji Yamato,et al.  Recognizing human action in time-sequential images using hidden Markov model , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.