Refining Adverse Drug Reaction Signals by Incorporating Interaction Variables Identified Using Emergent Pattern Mining

PURPOSE To develop a framework for identifying and incorporating candidate confounding interaction terms into a regularised cox regression analysis to refine adverse drug reaction signals obtained via longitudinal observational data. METHODS We considered six drug families that are commonly associated with myocardial infarction in observational healthcare data, but where the causal relationship ground truth is known (adverse drug reaction or not). We applied emergent pattern mining to find itemsets of drugs and medical events that are associated with the development of myocardial infarction. These are the candidate confounding interaction terms. We then implemented a cohort study design using regularised cox regression that incorporated and accounted for the candidate confounding interaction terms. RESULTS The methodology was able to account for signals generated due to confounding and a cox regression with elastic net regularisation correctly ranking the drug families known to be true adverse drug reactions above those that are not. This was not the case without the inclusion of the candidate confounding interaction terms, where confounding leads to a non-adverse drug reaction being ranked highest. CONCLUSIONS The methodology is efficient, can identify high-order confounding interactions and does not require expert input to specify outcome specific confounders, so it can be applied for any outcome of interest to quickly refine its signals. The proposed method shows excellent potential to overcome some forms of confounding and therefore reduce the false positive rate for signal analysis using longitudinal data.

[1]  Ying Li,et al.  A method for controlling complex confounding effects in the detection of adverse drug reactions using electronic health records , 2014, J. Am. Medical Informatics Assoc..

[2]  Uwe Aickelin,et al.  A Novel Semisupervised Algorithm for Rare Prescription Side Effect Discovery , 2014, IEEE Journal of Biomedical and Health Informatics.

[3]  Kaare Christensen,et al.  Causal Inference and Observational Research , 2010, Perspectives on psychological science : a journal of the Association for Psychological Science.

[4]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[5]  Uwe Aickelin,et al.  A Novel Semisupervised Algorithm for Rare Prescription Side Effect Discovery , 2014, IEEE J. Biomed. Health Informatics.

[6]  M. Grace,et al.  Hazard Ratio in Clinical Trials , 2004, Antimicrobial Agents and Chemotherapy.

[7]  John M Brooks,et al.  Squeezing the balloon: propensity scores and unmeasured covariate balance. , 2013, Health services research.

[8]  Uwe Aickelin,et al.  Comparison of algorithms that detect drug side effects using electronic healthcare databases , 2013, Soft Comput..

[9]  Carol Friedman,et al.  Mining electronic health records for adverse drug effects using regression based methods , 2010, IHI.

[10]  Fei Wang,et al.  From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records , 2014, KDD.

[11]  R. Tibshirani,et al.  A LASSO FOR HIERARCHICAL INTERACTIONS. , 2012, Annals of statistics.

[12]  Pedro J. Caraballo,et al.  Survival Association Rule Mining Towards Type 2 Diabetes Risk Assessment , 2013, AMIA.

[13]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[14]  David Madigan,et al.  Large‐scale regression‐based pattern discovery: The example of screening the WHO global drug safety database , 2010, Stat. Anal. Data Min..

[15]  W. Bilker,et al.  Validation studies of the health improvement network (THIN) database for pharmacoepidemiology research , 2007, Pharmacoepidemiology and drug safety.

[16]  G. Shepherd,et al.  Adverse Drug Reaction Deaths Reported in United States Vital Statistics, 1999-2006 , 2012, The Annals of pharmacotherapy.

[17]  M. Pirmohamed,et al.  Adverse drug reactions as cause of admission to hospital: prospective analysis of 18 820 patients , 2004, BMJ : British Medical Journal.

[18]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[19]  Ian Shrier,et al.  Re: "Variable selection for propensity score models". , 2007, American journal of epidemiology.

[20]  Cornelius Friesendorf,et al.  Squeezing the balloon? , 2006 .

[21]  Azeem Majeed,et al.  Ten-year trends in hospital admissions for adverse drug reactions in England 1999–2009 , 2010, Journal of the Royal Society of Medicine.

[22]  Uwe Aickelin,et al.  Signalling Paediatric Side Effects using an Ensemble of Simple Study Designs , 2014, Drug Safety.

[23]  Martijn J. Schuemie,et al.  Replication of the OMOP Experiment in Europe: Evaluating Methods for Risk Identification in Electronic Health Record Databases , 2013, Drug Safety.

[24]  M. Lindquist,et al.  A comparison of measures of disproportionality for signal detection in spontaneous reporting systems for adverse drug reactions , 2002, Pharmacoepidemiology and drug safety.

[25]  L. Härmark,et al.  Pharmacovigilance: methods, recent developments and future perspectives , 2008, European Journal of Clinical Pharmacology.

[26]  J. Avorn,et al.  Variable selection for propensity score models. , 2006, American journal of epidemiology.

[27]  S. Goldman,et al.  Limitations and strengths of spontaneous reports data. , 1998, Clinical therapeutics.

[28]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[29]  Kurt Hornik,et al.  Introduction to arules – A computational environment for mining association rules and frequent item sets , 2009 .

[30]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[31]  M. Pirmohamed,et al.  Adverse Drug Reactions in Hospital In-Patients: A Prospective Analysis of 3695 Patient-Episodes , 2009, PloS one.

[32]  M Pirmohamed,et al.  Adverse drug reactions in hospital in‐patients: a pilot study , 2006, Journal of clinical pharmacy and therapeutics.

[33]  Fan Yu,et al.  Towards large-scale twitter mining for drug-related adverse events , 2012, SHB '12.

[34]  Guoqing Diao,et al.  Estimation of time‐dependent area under the ROC curve for long‐term risk prediction , 2006, Statistics in medicine.

[35]  M. Milik,et al.  Mapping adverse drug reactions in chemical space. , 2009, Journal of medicinal chemistry.

[36]  D. Madigan,et al.  Empirical assessment of methods for risk identification in healthcare data: results from the experiments of the Observational Medical Outcomes Partnership , 2012, Statistics in medicine.

[37]  A. McMahon,et al.  Approaches to combat with confounding by indication in observational studies of intended drug effects , 2003, Pharmacoepidemiology and drug safety.

[38]  J. Avorn,et al.  High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims Data , 2009, Epidemiology.

[39]  Munir Pirmohamed,et al.  Adverse drug reactions as cause of admission to hospital , 2004 .

[40]  David Madigan,et al.  Large-scale regression-based pattern discovery: The example of screening the WHO global drug safety database , 2010 .

[41]  P. Grambsch,et al.  Modeling Survival Data: Extending the Cox Model , 2000 .