Countering imbalanced datasets to improve adverse drug event predictive models in labor and delivery

BACKGROUND The IOM report, Preventing Medication Errors, emphasizes the overall lack of knowledge of the incidence of adverse drug events (ADE). Operating rooms, emergency departments and intensive care units are known to have a higher incidence of ADE. Labor and delivery (L&D) is an emergency care unit that could have an increased risk of ADE, where reported rates remain low and under-reporting is suspected. Risk factor identification with electronic pattern recognition techniques could improve ADE detection rates. OBJECTIVE The objective of the present study is to apply Synthetic Minority Over Sampling Technique (SMOTE) as an enhanced sampling method in a sparse dataset to generate prediction models to identify ADE in women admitted for labor and delivery based on patient risk factors and comorbidities. RESULTS By creating synthetic cases with the SMOTE algorithm and using a 10-fold cross-validation technique, we demonstrated improved performance of the Naïve Bayes and the decision tree algorithms. The true positive rate (TPR) of 0.32 in the raw dataset increased to 0.67 in the 800% over-sampled dataset. CONCLUSION Enhanced performance from classification algorithms can be attained with the use of synthetic minority class oversampling techniques in sparse clinical datasets. Predictive models created in this manner can be used to develop evidence based ADE monitoring systems.

[1]  S. Kilpatrick,et al.  Defining a conceptual framework for near-miss maternal morbidity. , 2002, Journal of the American Medical Women's Association.

[2]  David C Classen,et al.  Evaluation of a Computer-Assisted Antibiotic-Dose Monitor , 1999, The Annals of pharmacotherapy.

[3]  D Bolukbasi,et al.  Comparison of maternal and neonatal outcomes with epidural bupivacaine plus fentanyl and ropivacaine plus fentanyl for labor analgesia. , 2005, International journal of obstetric anesthesia.

[4]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[5]  Tomohiro Shimo,et al.  [Intraoperative anaphylactic shock induced by methylergometrine and oxytocin]. , 2006, Masui. The Japanese journal of anesthesiology.

[6]  R. Gibberd,et al.  Epidemiology of medical error , 2000, BMJ : British Medical Journal.

[7]  N. Camp,et al.  Lobular breast cancer: Excess familiality observed in the Utah Population Database , 2005, International journal of cancer.

[8]  R M Gardner,et al.  Development of a computerized adverse drug event monitor. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[9]  N J Camp,et al.  Identification and study of Utah pseudo‐isolate populations—prospects for gene identification , 2005, American journal of medical genetics. Part A.

[10]  Vili Podgorelec,et al.  Finding the right decision tree's induction strategy for a hard real world problem , 2001, Int. J. Medical Informatics.

[11]  J. McCulloch,et al.  CHAPTER 7 – Implications for Prevention , 1972 .

[12]  P. Maurette [To err is human: building a safer health system]. , 2002, Annales francaises d'anesthesie et de reanimation.

[13]  Sandra K. Cesario Managing the Second Stage of Labor: Using Evidence to Guide Practice , 2004 .

[14]  N. Laird,et al.  Incidence of adverse drug events and potential adverse drug events , 1995 .

[15]  Philip S. Yu Editorial: New AE Introduction , 2003, IEEE Trans. Knowl. Data Eng..

[16]  Nitesh V. Chawla,et al.  SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[17]  C. Patterson,et al.  Joint Commission on Accreditation of Healthcare Organizations. , 1995 .

[18]  B. Gutsche,et al.  Predicting prolonged fetal heart rate deceleration following intrathecal fentanyl/bupivacaine. , 2005, International journal of obstetric anesthesia.

[19]  B. Thompson Canonical Correlation Analysis , 1984 .

[20]  S D Small,et al.  Incidence of adverse drug events and potential adverse drug events. Implications for prevention. ADE Prevention Study Group. , 1995, JAMA.

[21]  Gari D. Clifford,et al.  Shortliffe Edward H, Cimino James J: "Biomedical Informatics; Computer Applications in Health Care and Biomedicine" , 2006 .

[22]  Pei-Shan Tsai,et al.  Perioperative vasovagal syncope with focus on obstetric anesthesia. , 2006, Taiwanese journal of obstetrics & gynecology.

[23]  W. Gilbert,et al.  Accuracy of obstetric diagnoses and procedures in hospital discharge data. , 2006, American journal of obstetrics and gynecology.

[24]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[25]  A. Caughey,et al.  Maternal complications of pregnancy increase beyond 40 weeks of gestation in low-risk women , 2006, Journal of Perinatology.

[26]  Geoff Holmes,et al.  Benchmarking Attribute Selection Techniques for Discrete Class Data Mining , 2003, IEEE Trans. Knowl. Data Eng..

[27]  Deborah A Holden,et al.  Clinical risk management in obstetrics , 2004, Current opinion in obstetrics & gynecology.

[28]  V. Clark,et al.  Computer-aided multivariate analysis , 1991 .

[29]  Patrick S Romano,et al.  Coding of Perineal Lacerations and Other Complications of Obstetric Care in Hospital Discharge Data , 2005, Obstetrics and gynecology.

[30]  S. Kilpatrick,et al.  A scoring system identified near-miss maternal morbidity during pregnancy. , 2004, Journal of clinical epidemiology.

[31]  S. Kilpatrick,et al.  A descriptive model of preventability in maternal morbidity and mortality , 2004, Journal of Perinatology.

[32]  D. Bates,et al.  The Critical Care Safety Study: The incidence and nature of adverse events and serious medical errors in intensive care* , 2005, Critical care medicine.

[33]  N. Laird,et al.  Incidence of Adverse Drug Events and Potential Adverse Drug Events: Implications for Prevention , 1995 .

[34]  Feng Liu,et al.  A Neural Network Method for Prediction of Proteolytic Cleavage Sites in Neuropeptide Precursors , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[35]  Gary M. Weiss Mining with rarity: a unifying framework , 2004, SKDD.

[36]  L. Ohno-Machado Journal of Biomedical Informatics , 2001 .

[37]  L. Kohn,et al.  To Err Is Human : Building a Safer Health System , 2007 .

[38]  George C.J. Fernandez,et al.  Data Mining Using SAS Applications , 2002 .

[39]  P. Aspden,et al.  Preventing Medication Errors , 2007 .