Health Informatics via Machine Learning for the Clinical Management of Patients

OBJECTIVES To review how health informatics systems based on machine learning methods have impacted the clinical management of patients, by affecting clinical practice. METHODS We reviewed literature from 2010-2015 from databases such as Pubmed, IEEE xplore, and INSPEC, in which methods based on machine learning are likely to be reported. We bring together a broad body of literature, aiming to identify those leading examples of health informatics that have advanced the methodology of machine learning. While individual methods may have further examples that might be added, we have chosen some of the most representative, informative exemplars in each case. RESULTS Our survey highlights that, while much research is taking place in this high-profile field, examples of those that affect the clinical management of patients are seldom found. We show that substantial progress is being made in terms of methodology, often by data scientists working in close collaboration with clinical groups. CONCLUSIONS Health informatics systems based on machine learning are in their infancy and the translation of such systems into clinical management has yet to be performed at scale.

[1]  D. Koller,et al.  Integration of Early Physiological Responses Predicts Later Illness Severity in Preterm Infants , 2010, Science Translational Medicine.

[2]  David A. Clifton,et al.  Gaussian Processes for Personalized e-Health Monitoring With Wearable Sensors , 2013, IEEE Transactions on Biomedical Engineering.

[3]  J. Vincent Critical care - where have we been and where are we going? , 2013, Critical Care.

[4]  David A. Clifton,et al.  Extending the Generalised Pareto Distribution for Novelty Detection in High-Dimensional Spaces , 2013, J. Signal Process. Syst..

[5]  Mangla S. Gulati,et al.  Choosing wisely in adult hospital medicine: five opportunities for improved healthcare value. , 2013, Journal of hospital medicine.

[6]  Joshua C. Denny,et al.  Chapter 13: Mining Electronic Health Records in the Genomics Era , 2012, PLoS Comput. Biol..

[7]  Shamim Nemati,et al.  A Physiological Time Series Dynamics-Based Approach to Patient Monitoring and Outcome Prediction , 2014, IEEE Journal of Biomedical and Health Informatics.

[8]  Jenna Wiens,et al.  Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task , 2012, NIPS.

[9]  N. Donnelly,et al.  Demonstrating the accuracy of an in-hospital ambulatory patient monitoring solution in measuring respiratory rate , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[10]  J. R. Johnson,et al.  Predicting antimicrobial susceptibilities for Escherichia coli and Klebsiella pneumoniae isolates using whole genomic sequence data , 2013, The Journal of antimicrobial chemotherapy.

[11]  David A. Clifton,et al.  Novelty Detection with Multivariate Extreme Value Statistics , 2011, J. Signal Process. Syst..

[12]  David A. Clifton,et al.  Predictive Monitoring of Mobile Patients by Combining Clinical Observations With Data From Wearable Sensors , 2014, IEEE Journal of Biomedical and Health Informatics.

[13]  David A. Clifton,et al.  Probabilistic Estimation of Respiratory Rate from Wearable Sensors , 2015 .

[14]  Suchi Saria,et al.  Clustering Longitudinal Clinical Marker Trajectories from Electronic Health Data: Applications to Phenotyping and Endotype Discovery , 2015, AAAI.

[15]  Susan Sempeles Continuous Wireless Monitoring Device Passes Test in First Hospital Use , 2013 .

[16]  David A. Clifton,et al.  Modelling physiological deterioration in post-operative patient vital-sign data , 2013, Medical & Biological Engineering & Computing.

[17]  D. Fairchild,et al.  Impact of cardiac telemetry on patient safety and cost. , 2013, The American journal of managed care.

[18]  T. Lasko,et al.  Computational Phenotype Discovery Using Unsupervised Feature Learning over Noisy, Sparse, and Irregular Clinical Data , 2013, PloS one.

[19]  Gari D. Clifford,et al.  A New Severity of Illness Scale Using a Subset of Acute Physiology and Chronic Health Evaluation Data Elements Shows Comparable Predictive Accuracy* , 2013, Critical care medicine.

[20]  A. Auerbach,et al.  Use and outcomes of telemetry monitoring on a medicine service. , 2012, Archives of internal medicine.

[21]  David A. Clifton,et al.  Machine learning for the prediction of antibacterial susceptibility in Mycobacterium tuberculosis , 2014, IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[22]  David A. Clifton,et al.  Gaussian process clustering for the functional characterisation of vital-sign trajectories , 2013, 2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

[23]  M. Ghassemi,et al.  Topic Models for Mortality Modeling in Intensive Care Units , 2012 .

[24]  David A. Clifton,et al.  Signal-Quality Indices for the Electrocardiogram and Photoplethysmogram: Derivation and Applications to Wireless Monitoring , 2015, IEEE Journal of Biomedical and Health Informatics.

[25]  Yang Zhang,et al.  Probabilistic Novelty Detection With Support Vector Machines , 2014, IEEE Transactions on Reliability.

[26]  Jorge A. Gálvez,et al.  A Review of Analytics and Clinical Informatics in Health Care , 2014, Journal of Medical Systems.

[27]  Suchi Saria,et al.  Developing Predictive Models Using Electronic Medical Records: Challenges and Pitfalls , 2013, AMIA.

[28]  Matthew J. Heaton,et al.  Flexible Distributed Lag Models Using Random Functions With Application to Estimating Mortality Displacement From Heat-Related Deaths , 2012, Journal of agricultural, biological, and environmental statistics.

[29]  Nigam H. Shah,et al.  Toward personalizing treatment for depression: predicting diagnosis and severity , 2014, J. Am. Medical Informatics Assoc..

[30]  Daniel J. Wilson,et al.  Transforming clinical microbiology with bacterial genome sequencing , 2012, Nature Reviews Genetics.

[31]  Hesham Hassan,et al.  On the Significance of Fuzzification of the N and M in Cancer Staging , 2014, Cancer informatics.

[32]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[33]  Euan A Ashley,et al.  Clinical interpretation and implications of whole-genome sequencing. , 2014, JAMA.

[34]  D. E. Lawrence,et al.  APACHE—acute physiology and chronic health evaluation: a physiologically based classification system , 1981, Critical care medicine.

[35]  W. Carlo,et al.  Mortality reduction by heart rate characteristic monitoring in very low birth weight neonates: a randomized trial. , 2011, The Journal of pediatrics.

[36]  Scott A. McCombie,et al.  Early detection of the deteriorating patient: the case for a multi-parameter patient-worn monitor. , 2012, Biomedical instrumentation & technology.

[37]  David A. Clifton,et al.  A Large-Scale Clinical Validation of an Integrated Monitoring System in the Emergency Department , 2013, IEEE Journal of Biomedical and Health Informatics.

[38]  David A. Clifton,et al.  An Extreme Function Theory for Novelty Detection , 2013, IEEE Journal of Selected Topics in Signal Processing.

[39]  S. Steinhubl,et al.  Novel wireless devices for cardiac monitoring. , 2014, Circulation.

[40]  A. Butte,et al.  Disease Risk Factors Identified Through Shared Genetic Architecture and Electronic Medical Records , 2014, Science Translational Medicine.

[41]  E. Chen Appropriate Use of Telemetry Monitoring in Hospitalized Patients , 2014, Current Emergency and Hospital Medicine Reports.

[42]  David A. Clifton,et al.  Probabilistic estimation of respiratory rate using Gaussian processes , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[43]  M. Garber,et al.  Choosing wisely in pediatric hospital medicine: five opportunities for improved healthcare value. , 2013, Journal of hospital medicine.

[44]  John Shawe-Taylor,et al.  Extracting Diagnoses and Investigation Results from Unstructured Text in Electronic Health Records by Semi-Supervised Machine Learning , 2012, PloS one.

[45]  Matthew J. Heaton,et al.  An Analysis of an Incomplete Marked Point Pattern of Heat-Related 911 Calls , 2015 .

[46]  Finale Doshi-Velez,et al.  Comorbidity Clusters in Autism Spectrum Disorders: An Electronic Health Record Time-Series Analysis , 2014, Pediatrics.

[47]  V. Hasselblad,et al.  Effect of Clinical Decision-Support Systems , 2012, Annals of Internal Medicine.

[48]  George Hripcsak,et al.  Next-generation phenotyping of electronic health records , 2012, J. Am. Medical Informatics Assoc..

[49]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[50]  Anna Rumshisky,et al.  Unfolding physiological state: mortality modelling in intensive care units , 2014, KDD.

[51]  Melissa A. Basford,et al.  The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future , 2013, Genetics in Medicine.

[52]  Daniel J. Wilson,et al.  Prediction of Staphylococcus aureus Antimicrobial Resistance by Whole-Genome Sequencing , 2014, Journal of Clinical Microbiology.

[53]  A. Barabasi,et al.  Human disease classification in the postgenomic era: A complex systems approach to human pathobiology , 2007, Molecular systems biology.

[54]  Stephen J. Roberts,et al.  Variational Inference for Gaussian Process Modulated Poisson Processes , 2014, ICML.

[55]  Suzette J. Bielinski,et al.  Design and Anticipated Outcomes of the eMERGE-PGx Project: A Multi-Center Pilot for Pre-Emptive Pharmacogenomics in Electronic Health Record Systems , 2014, Clinical pharmacology and therapeutics.

[56]  Isaac S. Kohane,et al.  A translational engine at the national scale: informatics for integrating biology and the bedside , 2012, J. Am. Medical Informatics Assoc..

[57]  Jimeng Sun,et al.  Limestone: High-throughput candidate phenotype generation via tensor factorization , 2014, J. Biomed. Informatics.

[58]  Stephen B. Johnson,et al.  A review of approaches to identifying patient phenotype cohorts using electronic health records , 2013, J. Am. Medical Informatics Assoc..

[59]  Peter J. F. Lucas,et al.  Learning Bayesian networks for clinical time series analysis , 2014, J. Biomed. Informatics.

[60]  John P A Ioannidis,et al.  Predicting death: an empirical evaluation of predictive tools for mortality. , 2011, Archives of internal medicine.

[61]  Thomas A. Lasko,et al.  Efficient Inference of Gaussian-Process-Modulated Renewal Processes with Application to Medical Event Data , 2014, UAI.

[62]  Chao Wang,et al.  iGPSe: A visual analytic system for integrative genomic based cancer patient stratification , 2014, BMC Bioinformatics.

[63]  D. Jassal,et al.  The impact of telemetry on survival of in-hospital cardiac arrests in non-critical care patients. , 2013, Resuscitation.

[64]  P. Hoff,et al.  Evaluation of the appropriateness and outcome of in-hospital telemetry monitoring. , 2013, The American journal of cardiology.

[65]  David A. Clifton,et al.  Multi-Task Gaussian Processes for Multivariate Physiological Time-Series Analysis , 2014 .

[66]  David A. Clifton,et al.  Multitask Gaussian Processes for Multivariate Physiological Time-Series Analysis , 2015, IEEE Transactions on Biomedical Engineering.

[67]  Isaac S Kohane,et al.  Deeper, longer phenotyping to accelerate the discovery of the genetic architectures of diseases , 2014, Genome Biology.