Practice of Epidemiology Propensity Score Methods for Analyzing Observational Data Like Randomized Experiments: Challenges and Solutions for Rare Outcomes and Exposures

Randomized controlled trials are the "gold standard" for estimating the causal effects of treatments. However, it is often not feasible to conduct such a trial because of ethical concerns or budgetary constraints. We expand upon an approach to the analysis of observational data sets that mimics a sequence of randomized studies by implementing propensity score models within each trial to achieve covariate balance, using weighting and matching. The methods are illustrated using data from a safety study of the relationship between second-generation antipsychotics and type 2 diabetes (outcome) in Medicaid-insured children aged 10-18 years across the United States from 2003 to 2007. Challenges in this data set include a rare outcome, a rare exposure, substantial and important differences between exposure groups, and a very large sample size.

[1]  Douglas E Schaubel,et al.  The effect of salvage therapy on survival in a longitudinal study with treatment by indication , 2010, Statistics in medicine.

[2]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[3]  Jeffrey A. Smith,et al.  Does Matching Overcome Lalonde's Critique of Nonexperimental Estimators? , 2000 .

[4]  R. D'Agostino Propensity score methods for bias reduction in the comparison of a treatment to a non-randomized control group. , 2005, Statistics in medicine.

[5]  Roger Logan,et al.  Observational data for comparative effectiveness research: An emulation of randomised trials of statins and primary prevention of coronary heart disease , 2013, Statistical methods in medical research.

[6]  James M Robins,et al.  Randomized Trials Analyzed as Observational Studies , 2013, Annals of Internal Medicine.

[7]  Stephen R Cole,et al.  Adjusted survival curves with inverse probability weights , 2004, Comput. Methods Programs Biomed..

[8]  James Carpenter,et al.  Propensity scores: From naïve enthusiasm to intuitive understanding , 2012, Statistical methods in medical research.

[9]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[10]  A. Localio,et al.  Risk for incident diabetes mellitus following initiation of second-generation antipsychotics among Medicaid-enrolled youths. , 2015, JAMA pediatrics.

[11]  D. Thomas Discussion on "Statistical Issues Arising in the Women's Health Initiative" , 2005 .

[12]  Ian Shrier,et al.  Beyond intention to treat: What is the right question? , 2014, Clinical trials.

[13]  Douglas E Schaubel,et al.  A Sequential Stratification Method for Estimating the Effect of a Time‐Dependent Experimental Treatment in Observational Studies , 2006, Biometrics.

[14]  Richard Platt,et al.  Is size the next big thing in epidemiology? , 2013, Epidemiology.

[15]  James M. Robins,et al.  Observational Studies Analyzed Like Randomized Experiments: An Application to Postmenopausal Hormone Therapy and Coronary Heart Disease , 2008, Epidemiology.

[16]  B. Hansen Full Matching in an Observational Study of Coaching for the SAT , 2004 .

[17]  Peter C. Austin,et al.  The Relative Ability of Different Propensity Score Methods to Balance Measured Covariates Between Treated and Untreated Subjects in Observational Studies , 2009, Medical decision making : an international journal of the Society for Medical Decision Making.

[18]  R B D'Agostino,et al.  Relation of pooled logistic regression to time dependent Cox regression analysis: the Framingham Heart Study. , 1990, Statistics in medicine.

[19]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[20]  Y. Matsuyama,et al.  Marginal Structural Models as a Tool for Standardization , 2003, Epidemiology.

[21]  Brian K. Lee,et al.  Weight Trimming and Propensity Score Weighting , 2011, PloS one.

[22]  D. Rubin The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials , 2007, Statistics in medicine.

[23]  P. Austin The use of propensity score methods with survival or time-to-event outcomes: reporting measures of effect similar to those used in randomized experiments , 2013, Statistics in medicine.

[24]  I. White,et al.  Including all individuals is not enough: Lessons for intention-to-treat analysis , 2012, Clinical trials.

[25]  Jared K Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. , 2017, Statistics in medicine.

[26]  Gary King,et al.  MatchIt: Nonparametric Preprocessing for Parametric Causal Inference , 2011 .

[27]  Richard Goldstein,et al.  Regression Methods in Biostatistics: Linear, Logistic, Survival and Repeated Measures Models , 2006, Technometrics.

[28]  M. Olfson,et al.  Antipsychotics and the risk of type 2 diabetes mellitus in children and youth. , 2013, JAMA psychiatry.