Explaining Machine Learning Models for Clinical Gait Analysis

Machine learning (ML) is increasingly used to support decision-making in the healthcare sector. While ML approaches provide promising results with regard to their classification performance, most share a central limitation, namely their black-box character. Motivated by the interest to understand the functioning of ML models, methods from the field of Explainable Artificial Intelligence (XAI) have recently become important. This article investigates the usefulness of XAI methods in clinical gait classification. For this purpose, predictions of state-of-the-art classification methods are explained with an established XAI method, i.e., Layer-wise Relevance Propagation (LRP). We propose to evaluate the obtained explanations with two complementary approaches: a statistical analysis of the underlying data using Statistical Parametric Mapping and a qualitative evaluation by a clinical expert. A gait dataset comprising ground reaction force measurements from 132 patients with different lower-body gait disorders and 62 healthy controls is utilized. We investigate several gait classification tasks, employ multiple classification methods, and analyze the impact of data normalization and different signal components for classification performance and explanation quality. Our experiments show that explanations obtained by LRP exhibit promising statistical properties concerning inter-class discriminativity and are also in line with clinically relevant biomechanical gait characteristics.

[1]  Avanti Shrikumar,et al.  Learning Important Features Through Propagating Activation Differences , 2017, ICML.

[2]  Christian Biemann,et al.  What do we need to build explainable AI systems for the medical domain? , 2017, ArXiv.

[3]  François Chollet,et al.  Deep Learning with Python , 2017 .

[4]  Carlos Guestrin,et al.  Model-Agnostic Interpretability of Machine Learning , 2016, ArXiv.

[5]  Georg Langs,et al.  Causability and explainability of artificial intelligence in medicine , 2019, WIREs Data Mining Knowl. Discov..

[6]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[7]  Hailong Zhu,et al.  Support vector machine for classification of walking conditions of persons after stroke with dropped foot. , 2009, Human movement science.

[8]  Markus Wagner,et al.  KAVAGait: Knowledge-Assisted Visual Analytics for Clinical Gait Analysis , 2017, IEEE Transactions on Visualization and Computer Graphics.

[9]  Max Welling,et al.  Visualizing Deep Neural Network Decisions: Prediction Difference Analysis , 2017, ICLR.

[10]  Wojciech Samek,et al.  Toward Interpretable Machine Learning: Transparent Deep Neural Networks and Beyond , 2020, ArXiv.

[11]  Klaus-Robert Müller,et al.  Layer-Wise Relevance Propagation: An Overview , 2019, Explainable AI.

[12]  Bing Yu,et al.  Effect of dual task on gait asymmetry in patients after anterior cruciate ligament reconstruction , 2018, Scientific Reports.

[13]  Motoaki Kawanabe,et al.  How to Explain Individual Classification Decisions , 2009, J. Mach. Learn. Res..

[14]  Thompson Sarkodie-Gyan,et al.  Automatic classification of pathological gait patterns using ground reaction forces and machine learning algorithms , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[15]  Amit Dhurandhar,et al.  One Explanation Does Not Fit All: A Toolkit and Taxonomy of AI Explainability Techniques , 2019, ArXiv.

[16]  Brian Horsak,et al.  Input representations and classification strategies for automated human gait analysis. , 2019, Gait & posture.

[17]  Todd C Pataky,et al.  Generalized n-dimensional biomechanical field analysis using statistical parametric mapping. , 2010, Journal of biomechanics.

[18]  W I Schöllhorn,et al.  Applications of artificial neural nets in clinical biomechanics. , 2004, Clinical biomechanics.

[19]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[20]  Trevor Darrell,et al.  Generating Visual Explanations , 2016, ECCV.

[21]  Brian Horsak,et al.  GaitRec, a large-scale ground reaction force dataset of healthy and impaired gait , 2020, Scientific Data.

[22]  Jan Sijbers,et al.  STAPP: Spatiotemporal analysis of plantar pressure measurements using statistical parametric mapping. , 2018, Gait & posture.

[23]  Z. Hasan A Survey on Shari’Ah Governance Practices in Malaysia, GCC Countries and the UK , 2011 .

[24]  Vinzenz von Tscharner,et al.  Gait patterns of asymmetric ankle osteoarthritis patients. , 2012, Clinical biomechanics.

[25]  Klaus-Robert Müller,et al.  Explaining the unique nature of individual gait patterns with deep learning , 2018, Scientific Reports.

[26]  David S. Melnick,et al.  International evaluation of an AI system for breast cancer screening , 2020, Nature.

[27]  Kaat Desloovere,et al.  Statistical Parametric Mapping to Identify Differences between Consensus-Based Joint Patterns during Gait in Children with Cerebral Palsy , 2017, PloS one.

[28]  Kaat Desloovere,et al.  Probabilistic gait classification in children with cerebral palsy: a Bayesian approach. , 2011, Research in developmental disabilities.

[29]  Jürgen Bernard,et al.  FuryExplorer: visual-interactive exploration of horse motion capture data , 2015, Electronic Imaging.

[30]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[31]  Jacek M. Zurada,et al.  Sensitivity analysis for minimization of input data dimension for feedforward neural network , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[32]  Brian McWilliams,et al.  The Shattered Gradients Problem: If resnets are the answer, then what is the question? , 2017, ICML.

[33]  Thomas Brox,et al.  Synthesizing the preferred inputs for neurons in neural networks via deep generator networks , 2016, NIPS.

[34]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[35]  Eric J Topol,et al.  High-performance medicine: the convergence of human and artificial intelligence , 2019, Nature Medicine.

[36]  Todd C. Pataky,et al.  One-dimensional statistical parametric mapping in Python , 2012, Computer methods in biomechanics and biomedical engineering.

[37]  Madalina Fiterau,et al.  Machine learning in human movement biomechanics: Best practices, common pitfalls, and new opportunities. , 2018, Journal of biomechanics.

[38]  Angkoon Phinyomark,et al.  Analysis of Big Data in Gait Biomechanics: Current Trends and Future Directions , 2017, Journal of Medical and Biological Engineering.

[39]  Arnold Baca,et al.  Automatic Classification of Functional Gait Disorders , 2017, IEEE Journal of Biomedical and Health Informatics.

[40]  Saman K. Halgamuge,et al.  Classification of Parkinson's Disease Gait Using Spatial-Temporal Gait Features , 2015, IEEE Journal of Biomedical and Health Informatics.

[41]  Alexander Binder,et al.  Evaluating the Visualization of What a Deep Neural Network Has Learned , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[42]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[43]  Been Kim,et al.  Sanity Checks for Saliency Maps , 2018, NeurIPS.

[44]  T Chau,et al.  A review of analytical techniques for gait data. Part 1: Fuzzy, statistical and fractal methods. , 2001, Gait & posture.

[45]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[46]  Cuntai Guan,et al.  A Survey on Explainable Artificial Intelligence (XAI): Toward Medical XAI , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[47]  Shinichi Nakajima,et al.  Towards Best Practice in Explaining Neural Network Decisions with LRP , 2019, 2020 International Joint Conference on Neural Networks (IJCNN).

[48]  Klaus-Robert Müller,et al.  Explainable artificial intelligence , 2017 .

[49]  Klaus-Robert Müller,et al.  Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[50]  Alexander Binder,et al.  Unmasking Clever Hans predictors and assessing what machines really learn , 2019, Nature Communications.

[51]  Patrik Kutilek,et al.  Variability of centre of pressure movement during gait in young and middle-aged women. , 2014, Gait & posture.

[52]  Richard Baker Measuring Walking: A Handbook of Clinical Gait Analysis , 2013 .

[53]  Joana Figueiredo,et al.  Automatic recognition of gait patterns in human motor disorders using machine learning: A review. , 2018, Medical engineering & physics.

[54]  Alexander Binder,et al.  On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation , 2015, PloS one.

[55]  Jie Xu,et al.  The practical implementation of artificial intelligence technologies in medicine , 2019, Nature Medicine.

[56]  Sven Giesselbach,et al.  Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning , 2020, Frontiers in Bioengineering and Biotechnology.

[57]  Wojciech Samek,et al.  Methods for interpreting and understanding deep neural networks , 2017, Digit. Signal Process..

[58]  Andrea Vedaldi,et al.  Interpretable Explanations of Black Boxes by Meaningful Perturbation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[59]  A Baca,et al.  P 011-Towards an optimal combination of input signals and derived representations for gait classification based on ground reaction force measurements. , 2018, Gait & posture.

[60]  H. Haenssle,et al.  Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists , 2018, Annals of oncology : official journal of the European Society for Medical Oncology.

[61]  Ralf Mikut,et al.  Automated feature assessment in instrumented gait analysis. , 2006, Gait & posture.