Assessment and prediction of spine surgery invasiveness with machine learning techniques

BACKGROUND The interest in Minimally Invasive Surgery (MIS) techniques has greatly increased in the recent years due to their significant advantages, both in terms of outcome improvement and cost reduction. Also in spine surgery, MIS is now applicable to several conditions and, above all, in low back pain (LBP) treatment. However, reliable and objective measures of invasiveness, necessary to compare different procedures, are still lacking. METHODS In this article we study the application of Machine Learning (ML) techniques to define an invasiveness score for LBP procedures based on biological markers and inflammatory profiles. In so doing, we can assess the invasiveness of surgical procedures. We also propose a predictive model for treatment planning based on the evaluation of invasiveness of surgical alternatives for specific patients, using their pre-surgery biomarkers. The data used in study was characterized by low sample size and high-dimensionality, thus we adopted a combination of feature selection, careful selection of ML models and conservative model selection choices in order to address these concerns. We also performed an external validation based on a statistically significantly different datasets in order to confirm the relevance of the findings. RESULTS We report the results of an experimental study on real-world data, for which we obtained promising results for both considered applications: we report an AUC of 0.87 for the task of invasiveness score definition, and an AUC of 0.76 for the invasiveness prediction task. The results obtained on the external validation were in agreement with the obtained results. Further, in both cases the performances were considered as excellent by the involved clinicians and the selected predictive features were biologically relevant and associated with invasiveness and biological impact in the relevant literature. CONCLUSION Our results show that ML techniques could be effectively employed not only for diagnosis or prognosis, but also for treatment planning, a task of fundamental importance toward personalized and value-based healthcare. These results also show that ML approaches could be effectively used even in scenarios (e.g. pilot studies) where only small samples are available.

[1]  Robert P. W. Duin,et al.  Stabilizing classifiers for very small sample sizes , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[2]  Bernadette A. Thomas,et al.  Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010 , 2012, The Lancet.

[3]  Gail M. Williams,et al.  A systematic review of the global prevalence of low back pain. , 2012, Arthritis and rheumatism.

[4]  M. Porter,et al.  What is value in health care? , 2010, The New England journal of medicine.

[5]  Justin K Scheer,et al.  Development and Validation of a Novel Adult Spinal Deformity Surgical Invasiveness Score: Analysis of 464 Patients , 2018, Neurosurgery.

[6]  Alan H. Daniels,et al.  Predictive Modeling for Blood Transfusion After Adult Spinal Deformity Surgery: A Tree-Based Machine Learning Approach , 2018, Spine.

[7]  P. Berjano,et al.  Minimally invasive PLIF with divergent, cortical trajectory pedicle screws , 2015, European Spine Journal.

[8]  Michael Y. Wang,et al.  Economic impact of minimally invasive lumbar surgery. , 2015, World journal of orthopedics.

[9]  J. Villafañe,et al.  Complications in adult spine deformity surgery: a systematic review of the recent literature with reporting of aggregated incidences , 2018, European Spine Journal.

[10]  E. Ashley Towards precision medicine , 2016, Nature Reviews Genetics.

[11]  Alexandru G. Floares,et al.  Using Computational Intelligence to Develop Intelligent Clinical Decision Support Systems , 2009, CIBB.

[12]  R. Mobbs,et al.  Lumbar interbody fusion: techniques, indications and comparison of interbody fusion options including PLIF, TLIF, MI-TLIF, OLIF/ATP, LLIF and ALIF. , 2015, Journal of spine surgery.

[13]  Kyung-Soo Suk,et al.  The Quantitative Analysis of Tissue Injury Markers After Mini-Open Lumbar Fusion , 2006, Spine.

[14]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[15]  Martin Underwood,et al.  Non-specific low back pain , 2012, The Lancet.

[16]  A. Ng,et al.  Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet , 2018, PLoS medicine.

[17]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[18]  Davide Anguita,et al.  Model selection for support vector machines: Advantages and disadvantages of the Machine Learning Theory , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[19]  Lorri A. Lee,et al.  Development of an Index to Characterize the “Invasiveness” of Spine Surgery: Validation by Comparison to Blood Loss and Operative Time , 2008, Spine.

[20]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[21]  G. Andersson Epidemiological features of chronic low-back pain , 1999, The Lancet.

[22]  E. Tessitore,et al.  Extreme lateral interbody fusion (XLIF®): how I do it , 2015, Acta Neurochirurgica.

[23]  Nico Karssemeijer,et al.  Large scale deep learning for computer aided detection of mammographic lesions , 2017, Medical Image Anal..

[24]  Yoshua Bengio,et al.  Model Selection for Small Sample Regression , 2002, Machine Learning.

[25]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[26]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[27]  Shawn D. Pethel,et al.  Exact Test of Independence Using Mutual Information , 2014, Entropy.

[28]  R A Deyo,et al.  Outcome Measures for Low Back Pain Research: A Proposal for Standardized Use , 1998, Spine.

[29]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[30]  Michael Y. Wang,et al.  An analysis of postoperative thigh symptoms after minimally invasive transpsoas lumbar interbody fusion. , 2011, Journal of neurosurgery. Spine.

[31]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[32]  G. Banfi,et al.  Is Minimally Invasive Spine Surgery Also Minimally Pro-Inflammatory? Muscular Markers, Inflammatory Parameters and Cytokines to Quantify the Operative Invasiveness Assessment in Spine Fusion , 2014 .

[33]  G. Collins,et al.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): The TRIPOD Statement , 2015, Annals of Internal Medicine.

[34]  R. Zamar,et al.  A multivariate Kolmogorov-Smirnov test of goodness of fit , 1997 .

[35]  A. Fassa,et al.  Prevalence of chronic low back pain: systematic review , 2015, Revista de saude publica.

[36]  K. Luk,et al.  A Machine Learning-based Surface Electromyography Topography Evaluation for Prognostic Prediction of Functional Restoration Rehabilitation in Chronic Low Back Pain , 2017, Spine.

[37]  Elizabeth Salt,et al.  A Description and Comparison of Treatments for Low Back Pain in the United States , 2016, Orthopedic nursing.

[38]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  D. G. Bounds,et al.  A multilayer perceptron network for the diagnosis of low back pain , 1988, IEEE 1988 International Conference on Neural Networks.

[40]  Michael E. Tipping Sparse Bayesian Learning and the Relevance Vector Machine , 2001, J. Mach. Learn. Res..

[41]  R. Deyo,et al.  Expenditures and health status among adults with back and neck problems. , 2008, JAMA.

[42]  David Barber,et al.  Detecting Low Back Pain from Clinical Narratives Using Machine Learning Approaches , 2018, DEXA Workshops.

[43]  Z. Obermeyer,et al.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. , 2016, The New England journal of medicine.

[44]  Kevin A. Johnson,et al.  Multivariate classification of structural MRI data detects chronic low back pain. , 2014, Cerebral cortex.

[45]  R. Pietrobon,et al.  Reliability, Validity, and Responsiveness of the Short Form 12-Item Survey (SF-12) in Patients With Back Pain , 2003, Spine.

[46]  C. Lamartina,et al.  Erratum to: Minimally invasive lateral transpsoas approach with advanced neurophysiologic monitoring for lumbar interbody fusion , 2011, European Spine Journal.

[47]  R. Fisher THE FIDUCIAL ARGUMENT IN STATISTICAL INFERENCE , 1935 .

[48]  Tapio Salakoski,et al.  A comparison of AUC estimators in small-sample studies , 2009, MLSB.

[49]  D. Yassa,et al.  Invasiveness Index as a Predictor of Surgical Site Infection after Spinal Fusion, Revision Fusion, or Laminectomy , 2016, Infection Control & Hospital Epidemiology.