Matching by Propensity Score in Cohort Studies with Three Treatment Groups

Background: Nonrandomized pharmacoepidemiology generally compares one medication with another. For many conditions, clinicians can benefit from comparing the safety and effectiveness of three or more appropriate treatment options. We sought to compare three treatment groups simultaneously by creating 1:1:1 propensity score-matched cohorts. Methods: We developed a technique that estimates generalized propensity scores and then creates 1:1:1 matched sets. We compared this methodology with two existing approaches—construction of matched cohorts through a common-referent group and a pairwise match for each possible contrast. In a simulation, we varied unmeasured confounding, presence of treatment effect heterogeneity, and the prevalence of treatments and compared each method’s bias, variance, and mean squared error (MSE) of the treatment effect. We applied these techniques to a cohort of rheumatoid arthritis patients treated with nonselective nonsteroidal anti-inflammatory drugs, COX-2 selective inhibitors, or opioids. Results: We performed 1000 simulation runs. In the base case, we observed an average bias of 0.4% (MSE × 100 = 0.2) in the three-way matching approach and an average bias of 0.3% (MSE × 100 = 0.2) with the pairwise technique. The techniques showed differing bias and MSE with increasing treatment effect heterogeneity and decreasing propensity score overlap. With highly unequal exposure prevalences, strong heterogeneity, and low overlap, we observed a bias of 6.5% (MSE × 100 = 10.8) in the three-way approach and 12.5% (MSE × 100 = 12.3) in the pairwise approach. The empirical study displayed better covariate balance using the pairwise approach. Point estimates were substantially similar. Conclusions: Our matching approach offers an effective way to study the safety and effectiveness of three treatment options. We recommend its use over the pairwise or common-referent approaches.

[1]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[2]  Donald Rubin,et al.  Estimating Causal Effects from Large Data Sets Using Propensity Scores , 1997, Annals of Internal Medicine.

[3]  G. Imbens The Role of the Propensity Score in Estimating Dose-Response Functions , 1999 .

[4]  P. Rosenbaum,et al.  Substantial Gains in Bias Reduction from Matching with a Variable Number of Controls , 2000, Biometrics.

[5]  Paul R Rosenbaum,et al.  Optimal Pair Matching With Two Control Groups , 2004 .

[6]  M Alan Brookhart,et al.  Analytic strategies to adjust confounding using exposure propensity scores and disease risk scores: nonsteroidal antiinflammatory drugs and short-term mortality in the elderly. , 2005, American journal of epidemiology.

[7]  J. Avorn,et al.  A review of uses of health care utilization databases for epidemiologic research on therapeutics. , 2005, Journal of clinical epidemiology.

[8]  Til Stürmer,et al.  A review of the application of propensity score methods yielded increasing use, advantages in specific settings, but not substantially different estimates compared with conventional multivariable methods. , 2006, Journal of clinical epidemiology.

[9]  Peter C Austin,et al.  A critical appraisal of propensity‐score matching in the medical literature between 1996 and 2003 , 2008, Statistics in medicine.

[10]  Peter C Austin,et al.  Some Methods of Propensity‐Score Matching had Superior Performance to Others: Results of an Empirical Investigation and Monte Carlo simulations , 2009, Biometrical journal. Biometrische Zeitschrift.

[11]  J. Rassen,et al.  Confounding Control in Healthcare Database Research: Challenges and Potential Approaches , 2010, Medical care.

[12]  J. Rassen,et al.  The comparative safety of opioids for nonmalignant pain in older adults. , 2011, Archives of internal medicine.

[13]  J. Rassen,et al.  The comparative safety of analgesics in older adults with arthritis. , 2010, Archives of internal medicine.

[14]  Til Stürmer,et al.  Confounder summary scores when comparing the effects of multiple drug exposures , 2010, Pharmacoepidemiology and drug safety.

[15]  J. Avorn,et al.  Treatment effects in the presence of unmeasured confounding: dealing with observations in the tails of the propensity score distribution--a simulation study. , 2010, American journal of epidemiology.

[16]  P. Austin,et al.  Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies , 2010, Pharmaceutical statistics.

[17]  Anthonius Boer,et al.  Measuring balance and model selection in propensity score methods , 2011, Pharmacoepidemiology and drug safety.

[18]  Abhi Shelat,et al.  One‐to‐many propensity score matching in cohort studies , 2012, Pharmacoepidemiology and drug safety.

[19]  Xiao-Hua Zhou,et al.  Generalized propensity score for estimating the average treatment effect of multiple treatments , 2012, Statistics in medicine.