A learning-based markerless approach for full-body kinematics estimation in-natura from a single image.

We present a supervised machine learning approach for markerless estimation of human full-body kinematics for a cyclist from an unconstrained colour image. This approach is motivated by the limitations of existing marker-based approaches restricted by infrastructure, environmental conditions, and obtrusive markers. By using a discriminatively learned mixture-of-parts model, we construct a probabilistic tree representation to model the configuration and appearance of human body joints. During the learning stage, a Structured Support Vector Machine (SSVM) learns body parts appearance and spatial relations. In the testing stage, the learned models are employed to recover body pose via searching in a test image over a pyramid structure. We focus on the movement modality of cycling to demonstrate the efficacy of our approach. In natura estimation of cycling kinematics using images is challenging because of human interaction with a bicycle causing frequent occlusions. We make no assumptions in relation to the kinematic constraints of the model, nor the appearance of the scene. Our technique finds multiple quality hypotheses for the pose. We evaluate the precision of our method on two new datasets using loss functions. Our method achieves a score of 91.1 and 69.3 on mean Probability of Correct Keypoint (PCK) measure and 88.7 and 66.1 on the Average Precision of Keypoints (APK) measure for the frontal and sagittal datasets respectively. We conclude that our method opens new vistas to robust user-interaction free estimation of full body kinematics, a prerequisite to motion analysis.

[1]  Vicente Mata,et al.  Kinematic description of soft tissue artifacts: quantifying rigid versus deformation components and their relation with bone motion , 2012, Medical & Biological Engineering & Computing.

[2]  W. Taylor,et al.  A survey of formal methods for determining the centre of rotation of ball joints. , 2006, Journal of biomechanics.

[3]  Yi Yang,et al.  Articulated Human Detection with Flexible Mixtures of Parts , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  R. Riemer,et al.  Uncertainties in inverse dynamics solutions: a comprehensive analysis and an application to gait. , 2008, Gait & posture.

[5]  Silvio Savarese,et al.  An efficient branch-and-bound algorithm for optimal human pose estimation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Ankur Agarwal,et al.  Recovering 3D human pose from monocular images , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Silvia Fantozzi,et al.  Effectiveness of an automatic tracking software in underwater motion analysis. , 2013, Journal of sports science & medicine.

[8]  Hartmut Witte,et al.  ISB recommendation on definitions of joint coordinate system of various joints for the reporting of human joint motion--part I: ankle, hip, and spine. International Society of Biomechanics. , 2002, Journal of biomechanics.

[9]  Thomas P Andriacchi,et al.  Posturographic analysis through markerless motion capture without ground reaction forces measurement. , 2009, Journal of biomechanics.

[10]  Aurelio Cappozzo,et al.  Metrics for Describing Soft-Tissue Artefact and Its Effect on Pose, Size, and Shape of Marker Clusters , 2014, IEEE Transactions on Biomedical Engineering.

[11]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[12]  B. Galna,et al.  Quantification of soft tissue artifact in lower limb human motion analysis: a systematic review. , 2010, Gait & posture.

[13]  Bernt Schiele,et al.  2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Olga Veksler,et al.  Star Shape Prior for Graph-Cut Image Segmentation , 2008, ECCV.

[15]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Tine Alkjær,et al.  Markerless motion capture systems for tracking of persons in forensic biomechanics: an overview , 2014, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[17]  M O Heller,et al.  Repeatability and reproducibility of OSSCA, a functional approach for assessing the kinematics of the lower limb. , 2010, Gait & posture.

[18]  Tron Krosshaug,et al.  A model-based image-matching technique for three-dimensional reconstruction of human motion from uncalibrated video sequences. , 2005, Journal of biomechanics.

[19]  Peter V. Gehler,et al.  Human Pose Estimation with Fields of Parts , 2014, ECCV.

[20]  Bryan Buchholz,et al.  ISB recommendation on definitions of joint coordinate systems of various joints for the reporting of human joint motion--Part II: shoulder, elbow, wrist and hand. , 2005, Journal of biomechanics.

[21]  Lorenzo Chiari,et al.  Human movement analysis using stereophotogrammetry. Part 4: assessment of anatomical landmark misplacement and its effects on joint kinematics. , 2005, Gait & posture.

[22]  Hans-Peter Seidel,et al.  A data-driven approach for real-time full body pose reconstruction from a depth camera , 2011, 2011 International Conference on Computer Vision.

[23]  A. Cappello,et al.  Soft tissue artefact assessment in humeral axial rotation. , 2004, Gait & posture.

[24]  J. Challis A procedure for determining rigid body transformation parameters. , 1995, Journal of biomechanics.

[25]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[26]  Scott Tashman,et al.  The inaccuracy of surface-measured model-derived tibiofemoral kinematics. , 2012, Journal of biomechanics.

[27]  Claudio Cobelli,et al.  Markerless analysis of front crawl swimming. , 2011, Journal of biomechanics.

[28]  Cordelia Schmid,et al.  Mixing Body-Part Sequences for Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Peter V. Gehler,et al.  Poselet Conditioned Pictorial Structures , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Olga Veksler,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Stefano Corazza,et al.  A framework for the functional identification of joint centers using markerless motion capture, validation for the hip joint. , 2007, Journal of biomechanics.

[32]  Deva Ramanan,et al.  Detecting Actions, Poses, and Objects with Relational Phraselets , 2012, ECCV.

[33]  A. Cappozzo,et al.  Human movement analysis using stereophotogrammetry. Part 1: theoretical background. , 2005, Gait & posture.

[34]  Pascal Fua,et al.  Surface Deformation Models for Nonrigid 3D Shape Recovery , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Henrik Aanæs,et al.  Markerless motion capture can provide reliable 3D gait kinematics in the sagittal and frontal plane. , 2014, Medical engineering & physics.

[36]  A. Cappozzo,et al.  Human movement analysis using stereophotogrammetry. Part 3. Soft tissue artifact assessment and compensation. , 2005, Gait & posture.

[37]  Larry S. Davis,et al.  Context and observation driven latent variable model for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Andrew Zisserman,et al.  2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images , 2012, International Journal of Computer Vision.

[39]  Xiaogang Wang,et al.  Structured Feature Learning for Pose Estimation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Aurelio Cappozzo,et al.  Non-invasive assessment of superficial soft tissue local displacements during movement: a feasibility study. , 2009, Journal of biomechanics.

[41]  Alan L. Yuille,et al.  Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations , 2014, NIPS.

[42]  J J O'Connor,et al.  Bone position estimation from skin marker co-ordinates using global optimisation with joint constraints. , 1999, Journal of biomechanics.

[43]  Joseph J Crisco,et al.  Kinematic differences between optical motion capture and biplanar videoradiography during a jump-cut maneuver. , 2013, Journal of biomechanics.

[44]  Bernt Schiele,et al.  Pictorial structures revisited: People detection and articulated pose estimation , 2009, CVPR.

[45]  H. Hatze The fundamental problem of myoskeletal inverse dynamics and its implications. , 2002, Journal of biomechanics.

[46]  C. Cobelli,et al.  A Markerless Motion Capture System to Study Musculoskeletal Biomechanics: Visual Hull and Simulated Annealing Approach , 2006, Annals of Biomedical Engineering.

[47]  R Dumas,et al.  A model of the soft tissue artefact rigid component. , 2015, Journal of biomechanics.

[48]  Vittorio Ferrari,et al.  Better Appearance Models for Pictorial Structures , 2009, BMVC.

[49]  E Van Praagh,et al.  Leg cycling tracking by dynamic vision. , 1997, Journal of biomechanics.

[50]  Michael Damsgaard,et al.  Do kinematic models reduce the effects of soft tissue artefacts in skin marker-based motion analysis? An in vivo study of knee kinematics. , 2010, Journal of biomechanics.

[51]  Andrew Zisserman,et al.  Efficient discriminative learning of parts-based models , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[52]  Tomohiro Gonjo,et al.  Reliability of Three-Dimensional Angular Kinematics and Kinetics of Swimming Derived from Digitized Video. , 2016, Journal of sports science & medicine.

[53]  Hanna Schell,et al.  On the influence of soft tissue coverage in the determination of bone kinematics using skin markers , 2005, Journal of orthopaedic research : official publication of the Orthopaedic Research Society.

[54]  Jean-Yves Guillemaut,et al.  Athlete Pose Estimation from Monocular TV Sports Footage , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.