Do no harm: a roadmap for responsible machine learning for health care

Interest in machine-learning applications within medicine has been growing, but few studies have progressed to deployment in patient care. We present a framework, context and ultimately guidelines for accelerating the translation of machine-learning-based interventions in health care. To be successful, translation will require a team of engaged stakeholders and a systematic process from beginning (problem formulation) to end (widespread deployment).In this Perspective, the authors present a framework, context and guidelines for accelerating the translation of machine-learning-based interventions in health care.

[1]  John F. Hurdle,et al.  Measuring diagnoses: ICD code accuracy. , 2005, Health services research.

[2]  David R Williams,et al.  Race, socioeconomic status, and health: Complexities, ongoing challenges, and research opportunities , 2010, Annals of the New York Academy of Sciences.

[3]  G. Moody,et al.  Predicting in-hospital mortality of ICU patients: The PhysioNet/Computing in cardiology challenge 2012 , 2012, 2012 Computing in Cardiology.

[4]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[5]  P. Pronovost,et al.  A targeted real-time early warning score (TREWScore) for septic shock , 2015, Science Translational Medicine.

[6]  R J Lilford,et al.  The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting , 2015, BMJ : British Medical Journal.

[7]  Cathy O'Neil,et al.  Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy , 2016, Vikalpa: The Journal for Decision Makers.

[8]  Suchi Saria,et al.  Reliable Decision Support using Counterfactual Models , 2017, NIPS.

[9]  Steve Chien,et al.  Robotic space exploration agents , 2017, Science Robotics.

[10]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[11]  Tony Doyle,et al.  Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy , 2017, Inf. Soc..

[12]  Matthew Hutson Even artificial intelligence can acquire biases against race and gender , 2017 .

[13]  Ying Zhang,et al.  Multivariate Time Series Imputation with Generative Adversarial Networks , 2018, NeurIPS.

[14]  Philip Bachman,et al.  Deep Reinforcement Learning that Matters , 2017, AAAI.

[15]  Marzyeh Ghassemi,et al.  Rethinking clinical prediction: Why machine learning must consider year of care and feature aggregation , 2018, ArXiv.

[16]  Barbara Evans,et al.  The Challenge of Regulating Clinical Decision Support Software After 21st Century Cures , 2018, American Journal of Law & Medicine.

[17]  Harris Mateen Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy , 2018 .

[18]  Preface: The 21st Century Cures Act—A Cure for the 21st Century? , 2018, American Journal of Law & Medicine.

[19]  Jenna Wiens,et al.  A Generalizable, Data-Driven Approach to Predict Daily Risk of Clostridium difficile Infection at Two Large Academic Health Centers , 2018, Infection Control & Hospital Epidemiology.

[20]  M. Ghassemi,et al.  Can AI Help Reduce Disparities in General Medical and Mental Health Care? , 2019, AMA journal of ethics.

[21]  Suchi Saria,et al.  Tutorial: Safe and Reliable Machine Learning , 2019, ArXiv.

[22]  Nigam H Shah,et al.  The number needed to benefit: estimating the value of predictive analytics in healthcare , 2019, J. Am. Medical Informatics Assoc..

[23]  Suchi Saria,et al.  Can You Trust This Prediction? Auditing Pointwise Reliability After Learning , 2019, AISTATS.

[24]  Jie Xu,et al.  The practical implementation of artificial intelligence technologies in medicine , 2019, Nature Medicine.