Motion analysis: Action detection, recognition and evaluation based on motion capture data

Abstract A novel framework, for real-time action detection, recognition and evaluation of motion capture data, is presented in this paper. Pose and kinematics information is used for data description. Automatic and dynamic weighting, altering joint data significance based on action involvement, and Kinetic energy-based descriptor sampling are employed for efficient action segmentation and labelling. The automatically segmented and recognized action instances are subsequently fed to the framework action evaluation component, which compares them with the corresponding reference ones, estimating their similarity. Exploiting fuzzy logic, the framework subsequently gives semantic feedback with instructions on performing the actions more accurately. Experimental results on MSR-Action3D and MSRC12 benchmarking datasets and a new, publicly available one, provide evidence that the proposed framework compares favourably to state-of-the-art methods by 0.5–6% in all three datasets, showing that the proposed method can be effectively used for unsupervised gesture/action training.

[1]  Jake K. Aggarwal,et al.  View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[2]  Anuj Srivastava,et al.  Action Recognition Using Rate-Invariant Analysis of Skeletal Shape Trajectories , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Ying Wu,et al.  Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Ioannis A. Kakadiaris,et al.  A Review of Human Activity Recognition Methods , 2015, Front. Robot. AI.

[5]  Mohan M. Trivedi,et al.  Joint Angles Similarities and HOG2 for Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[6]  Haitham Bou-Ammar,et al.  Factored four way conditional restricted Boltzmann machines for activity recognition , 2015, Pattern Recognit. Lett..

[7]  Helena M. Mentis,et al.  Instructing people for training gestural interactive systems , 2012, CHI.

[8]  Taku Komura,et al.  A Virtual Reality Dance Training System Using Motion Capture Technology , 2011, IEEE Transactions on Learning Technologies.

[9]  Jon Bentley,et al.  Programming pearls: algorithm design techniques , 1984, CACM.

[10]  Petros Daras,et al.  Estimating human motion from multiple Kinect sensors , 2013, MIRAGE '13.

[11]  Youssef Chahir,et al.  Spatiotemporal representation of 3D skeleton joints-based action recognition using modified spherical harmonics , 2016, Pattern Recognit. Lett..

[12]  Wanqing Li,et al.  Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[13]  Adrien Chan-Hon-Tong,et al.  Simultaneous segmentation and classification of human actions in video streams using deeply optimized Hough transform , 2014, Pattern Recognit..

[14]  Hairong Qi,et al.  Group Sparsity and Geometry Constrained Dictionary Learning for Action Recognition from Depth Maps , 2013, 2013 IEEE International Conference on Computer Vision.

[15]  Amr Sharaf,et al.  Real-Time Multi-scale Action Detection from 3D Skeleton Data , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[16]  Alberto Del Bimbo,et al.  Combined shape analysis of human poses and motion units for action segmentation and recognition , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[17]  Xin Zhao,et al.  Structured Streaming Skeleton -- A New Feature for Online Human Gesture Recognition , 2014, TOMM.

[18]  Markus H. Gross,et al.  Combining body sensors and visual sensors for motion training , 2005, ACE '05.

[19]  Radha Poovendran,et al.  Group Event Detection With a Varying Number of Group Members for Video Surveillance , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Petros Daras,et al.  Quaternionic Signal Processing Techniques for Automatic Evaluation of Dance Performances From MoCap Data , 2014, IEEE Transactions on Multimedia.

[21]  Guijin Wang,et al.  A novel hierarchical framework for human action recognition , 2016, Pattern Recognit..

[22]  Jessica K. Hodgins,et al.  Interactive control of avatars animated with human motion data , 2002, SIGGRAPH.

[23]  Cristian Sminchisescu,et al.  The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[24]  Petros Maragos,et al.  Multimodal human action recognition in assistive human-robot interaction , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Baining Guo,et al.  Exemplar-Based Human Action Pose Correction , 2014, IEEE Transactions on Cybernetics.

[26]  Moustafa Meshry,et al.  Linear-time online action detection from 3D skeletal data using bags of gesturelets , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[27]  Eshed Ohn-Bar,et al.  Joint Angles Similiarities and HOG 2 for Action Recognition , 2013 .

[28]  Sebastian Nowozin,et al.  Action Points: A Representation for Low-latency Online Human Action Recognition , 2012 .

[29]  Lasitha Piyathilaka,et al.  Human Activity Recognition for Domestic Robots , 2013, FSR.

[30]  Marwan Torki,et al.  Human Action Recognition Using a Temporal Hierarchy of Covariance Descriptors on 3D Joint Locations , 2013, IJCAI.

[31]  H Ghasemzadeh,et al.  Coordination Analysis of Human Movements With Body Sensor Networks: A Signal Processing Model to Evaluate Baseball Swings , 2011, IEEE Sensors Journal.

[32]  Howard Leung,et al.  Interactive dancing game with real-time recognition of continuous dance moves from 3D human motion capture , 2011, ICUIMC '11.

[33]  Gérard G. Medioni,et al.  Structured Time Series Analysis for Human Action Segmentation and Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Srinivas Akella,et al.  3D human action segmentation and recognition using pose kinetic energy , 2014, 2014 IEEE International Workshop on Advanced Robotics and its Social Impacts.

[35]  Joaquim A. Jorge,et al.  Computer-assisted rehabilitation: towards effective evaluation , 2013 .