Using Rich Data on Comorbidities in Case-Control Study Design with Electronic Health Record Data Improves Control of Confounding in the Detection of Adverse Drug Reactions

Recent research has suggested that the case-control study design, unlike the self-controlled study design, performs poorly in controlling confounding in the detection of adverse drug reactions (ADRs) from administrative claims and electronic health record (EHR) data, resulting in biased estimates of the causal effects of drugs on health outcomes of interest (HOI) and inaccurate confidence intervals. Here we show that using rich data on comorbidities and automatic variable selection strategies for selecting confounders can better control confounding within a case-control study design and provide a more solid basis for inference regarding the causal effects of drugs on HOIs. Four HOIs are examined: acute kidney injury, acute liver injury, acute myocardial infarction and gastrointestinal ulcer hospitalization. For each of these HOIs we use a previously published reference set of positive and negative control drugs to evaluate the performance of our methods. Our methods have AUCs that are often substantially higher than the AUCs of a baseline method that only uses demographic characteristics for confounding control. Our methods also give confidence intervals for causal effect parameters that cover the expected no effect value substantially more often than this baseline method. The case-control study design, unlike the self-controlled study design, can be used in the fairly typical setting of EHR databases without longitudinal information on patients. With our variable selection method, these databases can be more effectively used for the detection of ADRs.

[1]  Ling Lan,et al.  Statistical Methods for Evaluating Safety in Medical Product Development A. Lawrence Gould John Wiley & Sons, 2015, 400 pages, $99.95, hardcover ISBN: 978-1-119-97966-1 , 2018, International Statistical Review.

[2]  S. Schneeweiss,et al.  Considerations for the analysis of longitudinal electronic health records linked to claims data to study the effectiveness and safety of drugs , 2016, Clinical pharmacology and therapeutics.

[3]  S. Kruger Design Of Observational Studies , 2016 .

[4]  F. Piscaglia,et al.  Hepatorenal syndrome: Update on diagnosis and treatment. , 2015, World journal of nephrology.

[5]  C. Friedman,et al.  A Method to Combine Signals from Spontaneous Reporting Systems and Observational Healthcare Data to Detect Adverse Drug Reactions , 2015, Drug Safety.

[6]  A. Lawrence Gould Statistical Methods for Evaluating Safety in Medical Product Development: Gould/Statistical Methods for Evaluating Safety in Medical Product Development , 2014 .

[7]  Ying Li,et al.  A method for controlling complex confounding effects in the detection of adverse drug reactions using electronic health records , 2014, J. Am. Medical Informatics Assoc..

[8]  D. Madigan,et al.  A Systematic Statistical Approach to Evaluating Evidence from Observational Studies , 2014 .

[9]  Melissa A. Basford,et al.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data , 2013, Nature Biotechnology.

[10]  M. Schuemie,et al.  Defining a Reference Set to Support Methodological Research in Drug Safety , 2013, Drug Safety.

[11]  David Madigan,et al.  Empirical Performance of a New User Cohort Method: Lessons for Developing a Risk Identification and Analysis System , 2013, Drug Safety.

[12]  Patrick B. Ryan,et al.  Alternative Outcome Definitions and Their Effect on the Performance of Methods for Observational Outcome Studies , 2013, Drug Safety.

[13]  D. Madigan,et al.  A Comparison of the Empirical Performance of Methods for a Risk Identification System , 2013, Drug Safety.

[14]  N. Petejová,et al.  Acute kidney injury following acute pancreatitis: A review. , 2013, Biomedical papers of the Medical Faculty of the University Palacky, Olomouc, Czechoslovakia.

[15]  David Madigan,et al.  Empirical Performance of the Case–Control Method: Lessons for Developing a Risk Identification and Analysis System , 2013, Drug Safety.

[16]  Steven H. Brown,et al.  Automated identification of postoperative complications within an electronic medical record using natural language processing. , 2011, JAMA.

[17]  Stuart J. Nelson,et al.  Normalized names for clinical drugs: RxNorm at 6 years , 2011, J. Am. Medical Informatics Assoc..

[18]  A. Bate,et al.  Quantitative signal detection using spontaneous ADR reporting , 2009, Pharmacoepidemiology and drug safety.

[19]  Joshua D. Angrist,et al.  Mostly Harmless Econometrics: An Empiricist's Companion , 2008 .

[20]  C. Robert Discussion of "Sure independence screening for ultra-high dimensional feature space" by Fan and Lv. , 2008 .

[21]  J. Avorn,et al.  Increasing Levels of Restriction in Pharmacoepidemiologic Database Studies of Elderly and Comparison With Randomized Trial Results , 2007, Medical care.

[22]  Jianqing Fan,et al.  Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[23]  H. Zou,et al.  Addendum: Regularization and variable selection via the elastic net , 2005 .

[24]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[25]  I. Ralph Edwards,et al.  Principles of Signal Detection in Pharmacovigilance , 1997, Drug safety.

[26]  F. Weber Lactulose and combination therapy of hepatic encephalopathy: the role of the intestinal microflora. , 1996, Digestive diseases.

[27]  R. Nesto,et al.  Acute myocardial infarction in the diabetic patient: pathophysiology, clinical course and prognosis. , 1992, Journal of the American College of Cardiology.