Real-time human action recognition from motion capture data

Recognition of human actions is one of the important tasks in various computer vision applications including video surveillance, human computer interaction etc. Traditionally RGB or depth cameras are utilized for this task. In this work we propose an approach that utilizes motion capture data for recognizing actions. Motion capture provides accurate motion information of joints of body in 3D space. The 3D skeleton joint co-ordinates of the user provided by motion capture system are used to analyze the dynamics of the action being performed. The temporal variance of each joint of the skeleton and its time weighted variance serve as the features for classification. The time weighted variance feature embeds temporal information in the feature and helps in discriminating confusing actions such as sit-down and stand-up. These features can be extracted rapidly and suitable for real-time recognition. We demonstrate the performance of the proposed approach using correlation based metric and support vector machines (SVM) on the Multi-modal Human Action Detection dataset. The recognition accuracy of above 95% has been achieved.

[1]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Ivan Laptev,et al.  On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[3]  Ruzena Bajcsy,et al.  Berkeley MHAD: A comprehensive Multimodal Human Action Database , 2013, 2013 IEEE Workshop on Applications of Computer Vision (WACV).

[4]  Markus Koskela,et al.  Classification of RGB-D and Motion Capture Sequences Using Extreme Learning Machine , 2013, SCIA.

[5]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[6]  Ruzena Bajcsy,et al.  Sequence of the Most Informative Joints (SMIJ): A new representation for human skeletal action recognition , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[7]  Mubarak Shah,et al.  Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Du Tran,et al.  Human Activity Recognition with Metric Learning , 2008, ECCV.

[9]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.