Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training

Using standard digital cameras in combination with deep learning (DL) for pose estimation is promising for the in-home and independent use of exercise games (exergames). We need to investigate to what extent such DL-based systems can provide satisfying accuracy on exergame relevant measures. Our study assesses temporal variation (i.e., variability) in body segment lengths, while using a Deep Learning image processing tool (DeepLabCut, DLC) on two-dimensional (2D) video. This variability is then compared with a gold-standard, marker-based three-dimensional Motion Capturing system (3DMoCap, Qualisys AB), and a 3D RGB-depth camera system (Kinect V2, Microsoft Inc). Simultaneous data were collected from all three systems, while participants (N = 12) played a custom balance training exergame. The pose estimation DLC-model is pre-trained on a large-scale dataset (ImageNet) and optimized with context-specific pose annotated images. Wilcoxon’s signed-rank test was performed in order to assess the statistical significance of the differences in variability between systems. The results showed that the DLC method performs comparably to the Kinect and, in some segments, even to the 3DMoCap gold standard system with regard to variability. These results are promising for making exergames more accessible and easier to use, thereby increasing their availability for in-home exercise.

[1]  Jan Stegenga,et al.  Suitability of Kinect for measuring whole body movement patterns during exergaming. , 2014, Journal of biomechanics.

[2]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Song-Chun Zhu,et al.  Understanding tools: Task-oriented object modeling, learning and recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  A. Tiedemann,et al.  Exercise for preventing falls in older people living in the community. , 2016, The Cochrane database of systematic reviews.

[5]  Yoichi Iino,et al.  Evaluation of 3D Markerless Motion Capture Accuracy Using OpenPose With Multiple Video Cameras , 2019, bioRxiv.

[6]  Gabi Zeilig,et al.  Therapy for Stroke Rehabilitation Eliciting Upper Extremity Purposeful Movements Using Video Games : A Comparison With Traditional , 2014 .

[7]  Kevin M. Cury,et al.  DeepLabCut: markerless pose estimation of user-defined body parts with deep learning , 2018, Nature Neuroscience.

[8]  Ioannis A. Kakadiaris,et al.  3D Human pose estimation: A review of the literature and analysis of covariates , 2016, Comput. Vis. Image Underst..

[9]  Raymond W. McGorry,et al.  The validity of the first and second generation Microsoft Kinect™ for identifying joint center locations during static postures. , 2015, Applied ergonomics.

[10]  Beatrix Vereijken,et al.  Exercise and rehabilitation delivered through exergames in older adults: An integrative review of technologies, safety and efficacy , 2016, Int. J. Medical Informatics.

[11]  Pascal Fua,et al.  XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera , 2019, ACM Trans. Graph..

[12]  Alexander Mathis,et al.  A Primer on Motion Capture with Deep Learning: Principles, Pitfalls, and Perspectives , 2020, Neuron.

[13]  N. A. Borghese,et al.  Usability and Effects of an Exergame-Based Balance Training Program. , 2014, Games for health journal.

[14]  Jan Stegenga,et al.  Exergaming for balance training of elderly: state of the art and future developments , 2013, Journal of NeuroEngineering and Rehabilitation.

[15]  TeichriebVeronica,et al.  Motor Rehabilitation Using Kinect: A Systematic Review , 2015 .

[16]  Wiebren Zijlstra,et al.  A systematic review of gait perturbation paradigms for improving reactive stepping responses and falls risk among healthy older adults , 2017, European Review of Aging and Physical Activity.

[17]  Yichen Wei,et al.  Integral Human Pose Regression , 2017, ECCV.

[18]  Heri Ramampiaro,et al.  EfficientPose: Scalable single-person pose estimation , 2020, ArXiv.

[19]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[20]  T E Howe,et al.  Exercise for improving balance in older people. , 2007, The Cochrane database of systematic reviews.

[21]  Marco Morana,et al.  Human Activity Recognition Process Using 3-D Posture Data , 2015, IEEE Transactions on Human-Machine Systems.

[22]  A KakadiarisIoannis,et al.  3D Human pose estimation , 2016 .

[23]  Stepán Obdrzálek,et al.  Accuracy and robustness of Kinect pose estimation in the context of coaching of elderly population , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[24]  B Bonnechère,et al.  Validity and reliability of the Kinect within functional assessment activities: comparison with standard stereophotogrammetry. , 2014, Gait & posture.

[25]  David E Bloom,et al.  Towards a comprehensive public health response to population ageing , 2015, The Lancet.

[26]  D. Reisman,et al.  Observation of amounts of movement practice provided during stroke rehabilitation. , 2009, Archives of physical medicine and rehabilitation.

[27]  Thomas P Andriacchi,et al.  The evolution of methods for the capture of human movement leading to markerless motion capture for biomechanical applications , 2006, Journal of NeuroEngineering and Rehabilitation.

[28]  Mei-Hsiang Chen,et al.  Developing a Digital Game for Stroke Patients’ Upper Extremity Rehabilitation – Design, Usability and Effectiveness Assessment , 2015 .

[29]  D G Lloyd,et al.  Joint kinematic calculation based on clinical direct kinematic versus inverse kinematic gait models. , 2016, Journal of biomechanics.

[30]  Ilona J M de Rooij,et al.  Effect of Virtual Reality Training on Balance and Gait Ability in Patients With Stroke: Systematic Review and Meta-Analysis , 2016, Physical Therapy.

[31]  Yingli Tian,et al.  Monocular human pose estimation: A survey of deep learning-based methods , 2020, Comput. Vis. Image Underst..

[32]  Lorenzo Chiari,et al.  Human movement analysis using stereophotogrammetry. Part 4: assessment of anatomical landmark misplacement and its effects on joint kinematics. , 2005, Gait & posture.

[33]  Cristian Sminchisescu,et al.  Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Nassir Navab,et al.  Motor Rehabilitation Using Kinect: A Systematic Review. , 2015, Games for health journal.

[36]  Igor Tak,et al.  Validity of a New 3-D Motion Analysis Tool for the Assessment of Knee, Hip and Spine Joint Angles during the Single Leg Squat , 2020, Sensors.

[37]  Rachel Proffitt,et al.  Moving the Field Forward Rehabilitation : Virtual Reality Interventions for Stroke Considerations in the Efficacy and Effectiveness of , 2015 .

[38]  John E. Angus,et al.  Application of extended Kalman filter for improving the accuracy and smoothness of Kinect skeleton-joint estimates , 2014 .

[39]  Marjorie Skubic,et al.  Validation of a Kinect V2 based rehabilitation game , 2018, PloS one.

[40]  Luigi Raffo,et al.  Functional estimation of bony segment lengths using magneto-inertial sensing: Application to the humerus , 2018, PloS one.

[41]  Bernt Schiele,et al.  DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model , 2016, ECCV.

[42]  Luca Romeo,et al.  Accuracy evaluation of the Kinect v2 sensor during dynamic movements in a rehabilitation scenario , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[43]  Andrew Zisserman,et al.  Exploiting Temporal Context for 3D Human Pose Estimation in the Wild , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Julius Verrel,et al.  Accuracy and Reliability of the Kinect Version 2 for Clinical Measurement of Motor Function , 2016, PloS one.

[45]  Mohamed A. Elgharib,et al.  XNect , 2020 .