Empirical Performance of the Calibrated Self-Controlled Cohort Analysis Within Temporal Pattern Discovery: Lessons for Developing a Risk Identification and Analysis System

BackgroundObservational healthcare data offer the potential to identify adverse drug reactions that may be missed by spontaneous reporting. The self-controlled cohort analysis within the Temporal Pattern Discovery framework compares the observed-to-expected ratio of medical outcomes during post-exposure surveillance periods with those during a set of distinct pre-exposure control periods in the same patients. It utilizes an external control group to account for systematic differences between the different time periods, thus combining within- and between-patient confounder adjustment in a single measure.ObjectivesTo evaluate the performance of the calibrated self-controlled cohort analysis within Temporal Pattern Discovery as a tool for risk identification in observational healthcare data.Research DesignDifferent implementations of the calibrated self-controlled cohort analysis were applied to 399 drug-outcome pairs (165 positive and 234 negative test cases across 4 health outcomes of interest) in 5 real observational databases (four with administrative claims and one with electronic health records).MeasuresPerformance was evaluated on real data through sensitivity/specificity, the area under receiver operator characteristics curve (AUC), and bias.ResultsThe calibrated self-controlled cohort analysis achieved good predictive accuracy across the outcomes and databases under study. The optimal design based on this reference set uses a 360 days surveillance period and a single control period 180 days prior to new prescriptions. It achieved an average AUC of 0.75 and AUC >0.70 in all but one scenario. A design with three separate control periods performed better for the electronic health records database and for acute renal failure across all data sets. The estimates for negative test cases were generally unbiased, but a minor negative bias of up to 0.2 on the RR-scale was observed with the configurations using multiple control periods, for acute liver injury and upper gastrointestinal bleeding.ConclusionsThe calibrated self-controlled cohort analysis within Temporal Pattern Discovery shows promise as a tool for risk identification; it performs well at discriminating positive from negative test cases. The optimal parameter configuration may vary with the data set and medical outcome of interest.

[1]  Johan Hopstadius,et al.  Shrinkage observed-to-expected ratios for robust and transparent large-scale pattern discovery , 2011, Statistical methods in medical research.

[2]  N. Shah,et al.  Performance of Pharmacovigilance Signal‐Detection Algorithms for the FDA Adverse Event Reporting System , 2013, Clinical pharmacology and therapeutics.

[3]  B. Armstrong,et al.  A simple estimator of minimum detectable relative risk, sample size, or power in cohort studies. , 1987, American journal of epidemiology.

[4]  M. Lindquist,et al.  Quality criteria for early signals of possible adverse drug reactions , 1990, The Lancet.

[5]  M. Schuemie Safety surveillance of longitudinal databases: further methodological considerations , 2012, Pharmacoepidemiology and drug safety.

[6]  M D Rawlins,et al.  Spontaneous reporting of adverse drug reactions. , 1986, The Quarterly journal of medicine.

[7]  M. Lindquist,et al.  An ABC of Drug-Related Problems , 2000, Drug safety.

[8]  Janet Woodcock,et al.  Role of postmarketing surveillance in contemporary medicine. , 2011, Annual review of medicine.

[9]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[10]  C P Farrington,et al.  Case series analysis of adverse reactions to vaccines: a comparative evaluation. , 1996, American journal of epidemiology.

[11]  Benjamin M. Smith,et al.  Adverse events associated with treatment of latent tuberculosis in the general population , 2011, Canadian Medical Association Journal.

[12]  G. Niklas Norén,et al.  Temporal pattern discovery for trends and transient effects: its application to patient records , 2008, KDD.

[13]  M. Maclure The case-crossover design: a method for studying transient effects on the risk of acute events. , 1991, American journal of epidemiology.

[14]  David Madigan,et al.  Disproportionality methods for pharmacovigilance in longitudinal observational databases , 2013, Statistical methods in medical research.

[15]  S Suissa,et al.  THE CASE‐TIME-CONTROL DESIGN , 1995, Epidemiology.

[16]  K. Tomecki Acute liver disease associated with erythromycins, sulfonamides, and tetracyclines: Carson JL, Strom BL, Duff A, et al. Ann Intern Med 1993;119:576-83 , 1995 .

[17]  R. Tannen,et al.  Use of primary care electronic medical record database in drug efficacy research on cardiovascular outcomes: comparison of database and randomised controlled trial findings , 2009, BMJ : British Medical Journal.

[18]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[19]  Patrick B. Ryan,et al.  Empirical Performance of a Self-Controlled Cohort Method: Lessons for Developing a Risk Identification and Analysis System , 2013, Drug Safety.

[20]  M. Schuemie,et al.  Defining a Reference Set to Support Methodological Research in Drug Safety , 2013, Drug Safety.

[21]  J. Hallas Evidence of Depression Provoked by Cardiovascular Medication: A Prescription Sequence Symmetry Analysis , 1996, Epidemiology.

[22]  B. Strom,et al.  Acute Liver Disease Associated with Erythromycins, Sulfonamides, and Tetracyclines , 1993, Annals of Internal Medicine.

[23]  D. Madigan,et al.  Empirical assessment of methods for risk identification in healthcare data: results from the experiments of the Observational Medical Outcomes Partnership , 2012, Statistics in medicine.

[24]  M D Rawlins,et al.  Spontaneous reporting of adverse drug reactions. I: the data. , 1988, British journal of clinical pharmacology.

[25]  A. Bate,et al.  Safety surveillance of longitudinal databases: results on real‐world data , 2012 .

[26]  A. Dyer,et al.  Dietary Beta‐Carotene, Vitamin C, and Risk of Prostate Cancer , 1996, Epidemiology.

[27]  G. Niklas Norén,et al.  Temporal pattern discovery in longitudinal electronic patient records , 2010, Data Mining and Knowledge Discovery.

[28]  K. Clauson Drug-Induced Diseases: Prevention, Detection, and Management , 2005 .