Using Data-Driven Rules to Predict Mortality in Severe Community Acquired Pneumonia

Prediction of patient-centered outcomes in hospitals is useful for performance benchmarking, resource allocation, and guidance regarding active treatment and withdrawal of care. Yet, their use by clinicians is limited by the complexity of available tools and amount of data required. We propose to use Disjunctive Normal Forms as a novel approach to predict hospital and 90-day mortality from instance-based patient data, comprising demographic, genetic, and physiologic information in a large cohort of patients admitted with severe community acquired pneumonia. We develop two algorithms to efficiently learn Disjunctive Normal Forms, which yield easy-to-interpret rules that explicitly map data to the outcome of interest. Disjunctive Normal Forms achieve higher prediction performance quality compared to a set of state-of-the-art machine learning models, and unveils insights unavailable with standard methods. Disjunctive Normal Forms constitute an intuitive set of prediction rules that could be easily implemented to predict outcomes and guide criteria-based clinical decision making and clinical trial execution, and thus of greater practical usefulness than currently available prediction tools. The Java implementation of the tool JavaDNF will be publicly available.

[1]  R. Samudrala,et al.  Simple Linear Model Provides Highly Accurate Genotypic Predictions of HIV-1 Drug Resistance , 2003, Antiviral therapy.

[2]  K. Reinhart,et al.  Comparison of the performance of SAPS II, SAPS 3, APACHE II, and their customized prognostic models in a surgical intensive care unit. , 2008, British journal of anaesthesia.

[3]  J. Vincent,et al.  Classification, incidence, and outcomes of sepsis and multiple organ failure. , 2007, Contributions to nephrology.

[4]  M. Leinonen,et al.  Aetiology, outcome and prognostic factors in community-acquired pneumonia requiring hospitalization. , 1990, The European respiratory journal.

[5]  M. Feldmann,et al.  The rationale for the current boom in anti-TNFα treatment. Is there an effective means to define therapeutic targets for drugs that provide all the benefits of anti-TNFα and minimise hazards? , 1999, Annals of the rheumatic diseases.

[6]  Steven B. Johnson,et al.  Efficacy and safety of the monoclonal anti-tumor necrosis factor antibody F(ab′)2 fragment afelimomab in patients with severe sepsis and elevated interleukin-6 levels* , 2004, Critical care medicine.

[7]  Evangelos Triantaphyllou,et al.  On the minimum number of logical clauses inferred from examples , 1996, Comput. Oper. Res..

[8]  M. Fine,et al.  A prediction rule to identify low-risk patients with community-acquired pneumonia. , 1997, The New England journal of medicine.

[9]  M. Fine,et al.  Prognosis of patients hospitalized with community-acquired pneumonia. , 1990, The American journal of medicine.

[10]  G. Clermont,et al.  In silico design of clinical trials: A method coming of age , 2004, Critical care medicine.

[11]  R. Shafer,et al.  Genotypic predictors of human immunodeficiency virus type 1 drug resistance , 2006, Proceedings of the National Academy of Sciences.

[12]  R. Read Experimental therapies for sepsis directed against tumour necrosis factor. , 1998, The Journal of antimicrobial chemotherapy.

[13]  Bekele Afessa,et al.  Comparison of APACHE III, APACHE IV, SAPS 3, and MPM0III and influence of resuscitation status on model performance. , 2012, Chest.

[14]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[15]  M. Trabucchi,et al.  Is pneumonia still the old man's friend? , 2003, Archives of internal medicine.

[16]  Jianhua Chen,et al.  An incremental learning algorithm for constructing Boolean functions from positive and negative examples , 2002, Comput. Oper. Res..

[17]  D C Angus,et al.  Severity scoring systems in the modern intensive care unit. , 1998, Annals of the Academy of Medicine, Singapore.

[18]  Brendan Larder,et al.  Non‐parametric methods to predict HIV drug susceptibility phenotype from genotype , 2003, Statistics in medicine.

[19]  John A Kellum,et al.  Understanding the inflammatory cytokine response in pneumonia and sepsis: results of the Genetic and Inflammatory Markers of Sepsis (GenIMS) Study. , 2007, Archives of internal medicine.

[20]  Sorin Draghici,et al.  Predicting HIV drug resistance with neural networks , 2003, Bioinform..

[21]  J. Zimmerman,et al.  Acute Physiology and Chronic Health Evaluation (APACHE) IV: Hospital mortality assessment for today’s critically ill patients* , 2006, Critical care medicine.

[22]  E. Ibrahim,et al.  Community acquired acute bacterial and atypical pneumonia in Saudi Arabia. , 1992, Thorax.

[23]  Margaret M Parker,et al.  Surviving Sepsis Campaign guidelines for management of severe sepsis and septic shock , 2004, Critical care medicine.

[24]  C Kooperberg,et al.  Sequence Analysis Using Logic Regression , 2001, Genetic epidemiology.

[25]  James F. Gimpel A Method of Producing a Boolean Function Having an Arbitrarily Prescribed Prime Implicant Table , 1965, IEEE Trans. Electron. Comput..

[26]  D Draper,et al.  Changes in sickness at admission following the introduction of the prospective payment system. , 1990, JAMA.

[27]  G. Clermont,et al.  Predicting hospital mortality for patients in the intensive care unit: A comparison of artificial neural networks with logistic regression models , 2001, Critical care medicine.

[28]  S. Calvano,et al.  Human toll-like receptor 4 mutations but not CD14 polymorphisms are associated with an increased risk of gram-negative infections. , 2002, The Journal of infectious diseases.

[29]  M. Fine,et al.  Hospitalization decision in patients with community-acquired pneumonia: a prospective cohort study. , 1990, The American journal of medicine.

[30]  Sue E. Poynter,et al.  Role of gene polymorphisms in sepsis. , 2002, Pediatric critical care medicine : a journal of the Society of Critical Care Medicine and the World Federation of Pediatric Intensive and Critical Care Societies.

[31]  J. Schapiro,et al.  Methods for investigation of the relationship between drug-susceptibility phenotype and human immunodeficiency virus type 1 genotype with applications to AIDS clinical trials group 333. , 2000, The Journal of infectious diseases.

[32]  M. Levy,et al.  2001 SCCM/ESICM/ACCP/ATS/SIS International Sepsis Definitions Conference , 2003, Intensive care medicine.

[33]  J Ean,et al.  Efficacy and safety of recombinant human activated protein C for severe sepsis. , 2001, The New England journal of medicine.

[34]  G. Clermont,et al.  Comparison of Cox and Gray’s survival models in severe sepsis* , 2004, Critical care medicine.

[35]  Thomas Lengauer,et al.  Diversity and complexity of HIV-1 drug resistance: A bioinformatics approach to predicting phenotype from genotype , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Ingo Ruczinski,et al.  Identifying interacting SNPs using Monte Carlo logic regression , 2005, Genetic epidemiology.

[37]  Eduard E Vasilevskis,et al.  Mortality probability model III and simplified acute physiology score II: assessing their value in predicting length of stay and comparison to APACHE IV. , 2009, Chest.

[38]  T. Marrie,et al.  Community-acquired pneumonia requiring hospitalization: 5-year prospective study. , 1989, Reviews of infectious diseases.

[39]  R. Cantor,et al.  Tumor necrosis factor gene polymorphisms and the variable presentation and outcome of community-acquired pneumonia. , 2002, Chest.

[40]  B. Larder,et al.  Enhanced prediction of lopinavir resistance from genotype by use of artificial neural networks. , 2003, The Journal of infectious diseases.

[41]  F. Stüber Effects of genomic polymorphisms on the course of sepsis: is there a concept for gene therapy? , 2001, Journal of the American Society of Nephrology : JASN.

[42]  Predicting hospital-associated mortality for Medicare patients. A method for patients with stroke, pneumonia, acute myocardial infarction, and congestive heart failure. , 1988 .

[43]  B. Grandbastien,et al.  Can generic scores (Pediatric Risk of Mortality and Pediatric Index of Mortality) replace specific scores in predicting the outcome of presumed meningococcal septic shock in children? , 2001, Critical care medicine.

[44]  Mauricio G. C. Resende,et al.  A continuous approach to inductive inference , 1992, Math. Program..

[45]  A. Muriel,et al.  Performance of the third-generation models of severity scoring systems (APACHE IV, SAPS 3 and MPM-III) in acute kidney injury critically ill patients. , 2011, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[46]  Luc De Raedt,et al.  Phase Transitions and Stochastic Local Search in k-Term DNF Learning , 2002, ECML.

[47]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[48]  Gary Garber,et al.  The efficacy and safety of recombinant human activated protein C for the treatment of patients with severe sepsis (vol 28, pg 48, 2000) , 2001 .

[49]  Mitchell M. Levy,et al.  2001 SCCM/ESICM/ACCP/ATS/SIS International Sepsis Definitions Conference , 2003, Intensive Care Medicine.

[50]  George E. Hale The Law of Sun-Spot Polarity. , 1924 .

[51]  D Draper,et al.  Predicting hospital-associated mortality for Medicare patients. A method for patients with stroke, pneumonia, acute myocardial infarction, and congestive heart failure. , 1988, JAMA.

[52]  Thomas Lengauer,et al.  Geno2pheno: Interpreting Genotypic HIV Drug Resistance Tests , 2001, IEEE Intell. Syst..

[53]  J. Helterbrand,et al.  Efficacy and safety of recombinant human activated protein C for severe sepsis , 2003 .

[54]  D. Hosmer,et al.  A review of goodness of fit statistics for use in the development of logistic regression models. , 1982, American journal of epidemiology.

[55]  G. Clermont,et al.  Epidemiology of severe sepsis in the United States: Analysis of incidence, outcome, and associated costs of care , 2001, Critical care medicine.

[56]  G. Bernard,et al.  The effect of drotrecogin alfa (activated) on long-term survival after severe sepsis * , 2004, Critical care medicine.

[57]  M. Fine,et al.  Comparison of a disease-specific and a generic severity of illness measure for patients with community-acquired pneumonia , 1995, Journal of General Internal Medicine.