Added Value of Intraoperative Data for Predicting Postoperative Complications: Development and Validation of a MySurgeryRisk Extension

To test the hypothesis that accuracy, discrimination, and precision in predicting postoperative complications improve when using both preoperative and intraoperative data input features versus preoperative data alone. Models that predict postoperative complications often ignore important intraoperative physiological changes. Incorporation of intraoperative physiological data may improve model performance. This retrospective cohort analysis included 52,529 inpatient surgeries at a single institution during a 5 year period. Random forest machine learning models in the validated MySurgeryRisk platform made patient-level predictions for three postoperative complications and mortality during hospital admission using electronic health record data and patient neighborhood characteristics. For each outcome, one model trained with preoperative data alone and one model trained with both preoperative and intraoperative data. Models were compared by accuracy, discrimination (expressed as AUROC), precision (expressed as AUPRC), and reclassification indices (NRI). Machine learning models incorporating both preoperative and intraoperative data had greater accuracy, discrimination, and precision than models using preoperative data alone for predicting all three postoperative complications (intensive care unit length of stay >48 hours, mechanical ventilation >48 hours, and neurological complications including delirium) and in-hospital mortality (accuracy: 88% vs. 77%, AUROC: 0.93 vs. 0.87, AUPRC: 0.21 vs. 0.15). Overall reclassification improvement was 2.9-10.0% for complications and 11.2% for in-hospital mortality. Incorporating both preoperative and intraoperative data significantly increased accuracy, discrimination, and precision for machine learning models predicting postoperative complications.

[1]  D. Bertsimas,et al.  Surgical Risk Is Not Linear: Derivation and Validation of a Novel, User-friendly, and Machine-learning-based Predictive OpTimal Trees in Emergency Surgery Risk (POTTER) Calculator , 2018, Annals of surgery.

[2]  C. Ko,et al.  An Examination of American College of Surgeons NSQIP Surgical Risk Calculator Accuracy. , 2017, Journal of the American College of Surgeons.

[3]  Daisy Zhe Wang,et al.  Comparing clinical judgment with the MySurgeryRisk algorithm for preoperative risk assessment: A pilot usability study , 2019, Surgery.

[4]  Parisa Rashidi,et al.  Improved predictive models for acute kidney injury with IDEA: Intraoperative Data Embedded Analytics , 2018, PloS one.

[5]  Ewout W Steyerberg,et al.  Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers , 2011, Statistics in medicine.

[6]  H. Abdullah,et al.  Utilizing Machine Learning Methods for Preoperative Prediction of Postsurgical Mortality and Intensive Care Unit Admission , 2019, Annals of surgery.

[7]  Raymond A. Jean,et al.  Early postoperative death in extreme-risk patients: A perspective on surgical futility. , 2019, Surgery.

[8]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[9]  Vitaly Herasevich,et al.  Multicenter derivation and validation of an early warning score for acute respiratory failure or death in the hospital , 2018, Critical Care.

[10]  D. Clark,et al.  Procedure-based postoperative risk prediction using NSQIP data. , 2018, The Journal of surgical research.

[11]  Gloria P. Lipori,et al.  MySurgeryRisk: Development and Validation of a Machine-learning Risk Algorithm for Major Complications and Death After Surgery , 2019, Annals of surgery.

[12]  M. Goldblatt,et al.  Eye of the beholder: Risk calculators and barriers to adoption in surgical trainees , 2018, Surgery.

[13]  Aldo A. Faisal,et al.  The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care , 2018, Nature Medicine.

[14]  N. Valizadeh,et al.  ACS-NSQIP risk calculator predicts cohort but not individual risk of complication following colorectal resection. , 2019, American journal of surgery.

[15]  C. Ko,et al.  Development and evaluation of the universal ACS NSQIP surgical risk calculator: a decision aid and informed consent tool for patients and surgeons. , 2013, Journal of the American College of Surgeons.

[16]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[17]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[18]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[19]  H. Pitt,et al.  American College of Surgeons NSQIP Risk Calculator Accuracy for Emergent and Elective Colorectal Operations. , 2017, Journal of the American College of Surgeons.

[20]  Jesse M. Ehrenfeld,et al.  Use of the American College of Surgeons National Surgical Quality Improvement Program Surgical Risk Calculator During Preoperative Risk Discussion: The Patient Perspective , 2019, Anesthesia and analgesia.

[21]  D. Koller,et al.  Integration of Early Physiological Responses Predicts Later Illness Severity in Preterm Infants , 2010, Science Translational Medicine.

[22]  Tezcan Ozrazgat-Baslanti,et al.  The Pattern of Longitudinal Change in Serum Creatinine and 90-Day Mortality After Major Surgery , 2016, Annals of surgery.

[23]  C. Mackenzie,et al.  A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. , 1987, Journal of chronic diseases.