A comparative study of pose representation and dynamics modelling for online motion quality assessment

Quantitative assessment of the quality of motion is increasingly in demand by clinicians in healthcare and rehabilitation monitoring of patients. We study and compare the performances of different pose representations and HMM models of dynamics of movement for online quality assessment of human motion. In a general sense, our assessment framework builds a model of normal human motion from skeleton-based samples of healthy individuals. It encapsulates the dynamics of human body pose using robust manifold representation and a first-order Markovian assumption. We then assess deviations from it via a continuous online measure. We compare different feature representations, reduced dimensionality spaces, and HMM models on motions typically tested in clinical settings, such as gait on stairs and flat surfaces, and transitions between sitting and standing. Our dataset is manually labelled by a qualified physiotherapist. The continuous-state HMM, combined with pose representation based on body-joints' location, outperforms standard discrete-state HMM approaches and other skeleton-based features in detecting gait abnormalities, as well as assessing deviations from the motion model on a frame-by-frame basis.

[1]  Luc Van Gool,et al.  Exploiting simple hierarchies for unsupervised human behavior analysis , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Rama Chellappa,et al.  "Shape Activity": a continuous-state HMM for moving/deforming shapes with application to abnormal activity detection , 2005, IEEE Transactions on Image Processing.

[3]  Jake K. Aggarwal,et al.  View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[4]  Ehud Rivlin,et al.  Online action recognition using covariance of shape and motion , 2014, Comput. Vis. Image Underst..

[5]  Liang Wang,et al.  Learning and Matching of Dynamic Shape Manifolds for Human Action Recognition , 2007, IEEE Transactions on Image Processing.

[6]  Jesse Hoey,et al.  Automated Detection of Unusual Events on Stairs , 2006, The 3rd Canadian Conference on Computer and Robot Vision (CRV'06).

[7]  Jenq-Neng Hwang,et al.  A Review on Video-Based Human Activity Recognition , 2013, Comput..

[8]  Gérard G. Medioni,et al.  Home Monitoring Musculo-skeletal Disorders with a Single 3D Sensor , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[9]  Luc Van Gool,et al.  Coupled Action Recognition and Pose Estimation from Multiple Views , 2012, International Journal of Computer Vision.

[10]  Maja Pantic,et al.  Combined Support Vector Machines and Hidden Markov Models for Modeling Facial Action Temporal Dynamics , 2007, ICCV-HCI.

[11]  Ahmed M. Elgammal,et al.  The Role of Manifold Learning in Human Motion Analysis , 2006, Human Motion.

[12]  Svetha Venkatesh,et al.  Efficient duration and hierarchical modeling for human activity recognition , 2009, Artif. Intell..

[13]  Carl E. Rasmussen,et al.  Factorial Hidden Markov Models , 1997 .

[14]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[15]  Sudeep Sarkar,et al.  Improved gait recognition by gait dynamics normalization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[17]  Ramakant Nevatia,et al.  Recognition and Segmentation of 3-D Human Action Using HMM and Multi-class AdaBoost , 2006, ECCV.

[18]  Qing Zhang,et al.  A Survey on Human Motion Analysis from Depth Data , 2013, Time-of-Flight and Depth Imaging.

[19]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Ilaria Gori,et al.  Online Action Recognition via Nonparametric Incremental Learning , 2014, BMVC.

[21]  J. F. Yang,et al.  The modified Gait Abnormality Rating Scale for recognizing the risk of recurrent falls in community-dwelling elderly adults. , 1996, Physical therapy.

[22]  Alex Mihailidis,et al.  3D Human Motion Analysis to Detect Abnormal Events on Stairs , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[23]  Paul A. Viola,et al.  Online decoding of Markov models under latency constraints , 2006, ICML.

[24]  Frank Chongwoo Park,et al.  Natural Movement Generation Using Hidden Markov Models and Principal Components , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[25]  Eraldo Ribeiro,et al.  Human Motion Recognition Using Isomap and Dynamic Time Warping , 2007, Workshop on Human Motion.

[26]  Hsuan-Tien Lin,et al.  A note on Platt’s probabilistic outputs for support vector machines , 2007, Machine Learning.

[27]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[28]  Ying Wu,et al.  Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  R. Nevatia,et al.  Online, Real-time Tracking and Recognition of Human Actions , 2008, 2008 IEEE Workshop on Motion and video Computing.

[30]  Rama Chellappa,et al.  Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Andreas Wendemuth,et al.  Speech recognition with support vector machines in a hybrid system , 2005, INTERSPEECH.

[32]  Sebastian Nowozin,et al.  Action Points: A Representation for Low-latency Online Human Action Recognition , 2012 .

[33]  Majid Mirmehdi,et al.  Online quality assessment of human motion from skeleton data , 2014, BMVC.

[34]  Toby Sharp,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR.

[35]  Hoang Le Uyen Thuc,et al.  Quasi-periodic action recognition from monocular videos via 3D human models and cyclic HMMs , 2012, The 2012 International Conference on Advanced Technologies for Communications.

[36]  L. Wolfson,et al.  Gait assessment in the elderly: a gait abnormality rating scale and its relation to falls. , 1990, Journal of gerontology.

[37]  Antonio Torralba,et al.  Assessing the Quality of Actions , 2014, ECCV.

[38]  B. Nadler,et al.  Diffusion maps, spectral clustering and reaction coordinates of dynamical systems , 2005, math/0503445.

[39]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[40]  Andrew Zisserman,et al.  Upper Body Pose Estimation with Temporal Sequential Forests , 2014, BMVC.

[41]  Jake K. Aggarwal,et al.  Spatio-temporal Depth Cuboid Similarity Feature for Activity Recognition Using Depth Camera , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Mohan M. Trivedi,et al.  Joint Angles Similarities and HOG2 for Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[43]  Ross T. Whitaker,et al.  Robust non-linear dimensionality reduction using successive 1-dimensional Laplacian Eigenmaps , 2007, ICML '07.

[44]  Kejun Wang,et al.  Video-Based Abnormal Human Behavior Recognition—A Review , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[45]  Tae-Seong Kim,et al.  Continuous Hidden Markov Models for Depth Map-Based Human Activity Recognition , 2011 .

[46]  Ann B. Lee,et al.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Guillermo Sapiro,et al.  Connecting the Out-of-Sample and Pre-Image Problems in Kernel Methods , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Marwan Torki,et al.  Human Action Recognition Using a Temporal Hierarchy of Covariance Descriptors on 3D Joint Locations , 2013, IJCAI.

[49]  Tae-Seong Kim,et al.  Depth video-based gait recognition for smart home using local directional pattern features and hidden Markov model , 2014 .

[50]  Svetha Venkatesh,et al.  Tracking-as-Recognition for Articulated Full-Body Human Motion Analysis , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..