Using high‐dimensional propensity scores to automate confounding control in a distributed medical product safety surveillance system

Distributed medical product safety monitoring systems such as the Sentinel System, to be developed as a part of Food and Drug Administration's Sentinel Initiative, will require automation of large parts of the safety evaluation process to achieve the necessary speed and scale at reasonable cost without sacrificing validity. Although certain functions will require investigator intervention, confounding control is one area that can largely be automated. The high‐dimensional propensity score (hd‐PS) algorithm is one option for automated confounding control in longitudinal healthcare databases. In this article, we discuss the use of hd‐PS for automating confounding control in sequential database cohort studies, as applied to safety monitoring systems. In particular, we discuss the robustness of the covariate selection process, the potential for over‐ or under‐selection of variables including the possibilities of M‐bias and Z‐bias, the computation requirements, the practical considerations in a federated database network, and the cases where automated confounding adjustment may not function optimally. We also outline recent improvements to the algorithm and show how the algorithm has performed in several published studies. We conclude that despite certain limitations, hd‐PS offers substantial advantages over non‐automated alternatives in active product safety monitoring systems. Copyright © 2012 John Wiley & Sons, Ltd.

[1]  M Alan Brookhart,et al.  Instrumental variables II: instrumental variable application-in 25 variations, the physician prescribing preference generally was strong and reduced covariate imbalance. , 2009, Journal of clinical epidemiology.

[2]  Sebastian Schneeweiss,et al.  Multivariate‐adjusted pharmacoepidemiologic analyses of confidential information pooled from multiple health care utilization databases , 2010, Pharmacoepidemiology and drug safety.

[3]  J. Rassen,et al.  Confounding Control in Healthcare Database Research: Challenges and Potential Approaches , 2010, Medical care.

[4]  Paul Zador,et al.  Variable selection and raking in propensity scoring. , 2007, Statistics in medicine.

[5]  M. Maclure The case-crossover design: a method for studying transient effects on the risk of acute events. , 1991, American journal of epidemiology.

[6]  M Alan Brookhart,et al.  Instrumental variables I: instrumental variables exploit natural variation in nonexperimental data to estimate causal relationships. , 2009, Journal of clinical epidemiology.

[7]  R. Makuch,et al.  Gastrointestinal toxicity with celecoxib vs nonsteroidal anti-inflammatory drugs for osteoarthritis and rheumatoid arthritis: the CLASS study: A randomized controlled trial. Celecoxib Long-term Arthritis Safety Study. , 2000, JAMA.

[8]  J. Myers,et al.  Effects of adjusting for instrumental variables on bias and precision of effect estimates. , 2011, American journal of epidemiology.

[9]  G. Shaw,et al.  Maternal pesticide exposure from multiple sources and selected congenital anomalies. , 1999 .

[10]  Sengwee Toh,et al.  Confounding adjustment via a semi‐automated high‐dimensional propensity score algorithm: an application to electronic medical records , 2011, Pharmacoepidemiology and drug safety.

[11]  Lise,et al.  Comparison of upper gastrointestinal toxicity of rofecoxib and naproxen in patients with rheumatoid arthritis. VIGOR Study Group. , 2000, The New England journal of medicine.

[12]  I. Bross Spurious effects from an extraneous variable. , 1966, Journal of chronic diseases.

[13]  Donald Rubin,et al.  Estimating Causal Effects from Large Data Sets Using Propensity Scores , 1997, Annals of Internal Medicine.

[14]  Jeremy A Rassen,et al.  Privacy-Maintaining Propensity Score-Based Pooling of Multiple Databases Applied to a Study of Biologics , 2010, Medical care.

[15]  Sebastian Schneeweiss,et al.  Instrumental variable methods in comparative safety and effectiveness research , 2010, Pharmacoepidemiology and drug safety.

[16]  D. Rubin The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials , 2007, Statistics in medicine.

[17]  C R Weinberg,et al.  Toward a clearer definition of confounding. , 1993, American journal of epidemiology.

[18]  J. Avorn,et al.  High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims Data , 2009, Epidemiology.

[19]  J. Pearl,et al.  Causal diagrams for epidemiologic research. , 1999, Epidemiology.

[20]  W. Ray,et al.  Evaluating medication effects outside of clinical trials: new-user designs. , 2003, American journal of epidemiology.

[21]  S. Schneeweiss,et al.  Variation in the risk of suicide attempts and completed suicides by antidepressant agent in adults: a propensity score-adjusted analysis of 9 years' data. , 2010, Archives of general psychiatry.

[22]  J. Rassen,et al.  Applying propensity scores estimated in a full cohort to adjust for confounding in subgroup analyses , 2012, Pharmacoepidemiology and drug safety.

[23]  J. Pearl Invited commentary: understanding bias amplification. , 2011, American journal of epidemiology.

[24]  J. Avorn,et al.  Anticonvulsant medications and the risk of suicide, attempted suicide, or violent death. , 2010, Journal of the American Medical Association (JAMA).

[25]  S. Greenland Quantifying Biases in Causal Models: Classical Confounding vs Collider-Stratification Bias , 2003, Epidemiology.

[26]  M Maclure,et al.  Performance of comorbidity scores to control for confounding in epidemiologic studies using claims data. , 2001, American journal of epidemiology.

[27]  Sebastian Schneeweiss,et al.  Comparative Safety of Antidepressant Agents for Children and Adolescents Regarding Suicidal Acts , 2010, Pediatrics.

[28]  R J Glynn,et al.  Assessing the Comparative Effectiveness of Newly Marketed Medications: Methodological Challenges and Implications for Drug Development , 2011, Clinical pharmacology and therapeutics.

[29]  Judea Pearl,et al.  On a Class of Bias-Amplifying Variables that Endanger Effect Estimates , 2010, UAI.

[30]  J. Avorn,et al.  Treatment effects in the presence of unmeasured confounding: dealing with observations in the tails of the propensity score distribution--a simulation study. , 2010, American journal of epidemiology.

[31]  M Alan Brookhart,et al.  Covariate selection in high-dimensional propensity score analyses of treatment effects in small samples. , 2011, American journal of epidemiology.

[32]  Sebastian Schneeweiss,et al.  Cardiovascular Outcomes and Mortality in Patients Using Clopidogrel With Proton Pump Inhibitors After Percutaneous Coronary Intervention or Acute Coronary Syndrome , 2009, Circulation.

[33]  K. Huybrechts,et al.  Risk of death and hospital admission for major medical events after initiation of psychotropic medications in older adults admitted to nursing homes , 2011, Canadian Medical Association Journal.

[34]  Sebastian Schneeweiss,et al.  A basic study design for expedited safety signal evaluation based on electronic healthcare data , 2010, Pharmacoepidemiology and drug safety.

[35]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[36]  R Day,et al.  Comparison of upper gastrointestinal toxicity of rofecoxib and naproxen in patients with rheumatoid arthritis. VIGOR Study Group. , 2000, The New England journal of medicine.

[37]  P. Mahalanobis On the generalized distance in statistics , 1936 .