Estimating long‐term treatment effects in observational data: A comparison of the performance of different methods under real‐world uncertainty

In the presence of time‐dependent confounding, there are several methods available to estimate treatment effects. With correctly specified models and appropriate structural assumptions, any of these methods could provide consistent effect estimates, but with real‐world data, all models will be misspecified and it is difficult to know if assumptions are violated. In this paper, we investigate five methods: inverse probability weighting of marginal structural models, history‐adjusted marginal structural models, sequential conditional mean models, g‐computation formula, and g‐estimation of structural nested models. This work is motivated by an investigation of the effects of treatments in cystic fibrosis using the UK Cystic Fibrosis Registry data focussing on two outcomes: lung function (continuous outcome) and annual number of days receiving intravenous antibiotics (count outcome). We identified five features of this data that may affect the performance of the methods: misspecification of the causal null, long‐term treatment effects, effect modification by time‐varying covariates, misspecification of the direction of causal pathways, and censoring. In simulation studies, under ideal settings, all five methods provide consistent estimates of the treatment effect with little difference between methods. However, all methods performed poorly under some settings, highlighting the importance of using appropriate methods based on the data available. Furthermore, with the count outcome, the issue of non‐collapsibility makes comparison between methods delivering marginal and conditional effects difficult. In many situations, we would recommend using more than one of the available methods for analysis, as if the effect estimates are very different, this would indicate potential issues with the analyses.

[1]  James M Robins,et al.  The International Journal of Biostatistics CAUSAL INFERENCE Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes , Part II : Proofs of Results , 2011 .

[2]  Tyler J. VanderWeele,et al.  Concerning the consistency assumption in causal inference. , 2009, Epidemiology.

[3]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[4]  Andrew Forbes,et al.  Variance reduction in randomised trials by inverse probability weighting using the propensity score , 2013, Statistics in medicine.

[5]  Stijn Vansteelandt,et al.  A Note on G-Estimation of Causal Risk Ratios. , 2018, American journal of epidemiology.

[6]  T. Schuster,et al.  Effect Estimation in Point-Exposure Studies with Binary Outcomes and High-Dimensional Covariate Data – A Comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weighting , 2016, The international journal of biostatistics.

[7]  B L De Stavola,et al.  Methods for dealing with time‐dependent confounding , 2013, Statistics in medicine.

[8]  Patrick Royston,et al.  The design of simulation studies in medical statistics , 2006, Statistics in medicine.

[9]  Sherri Rose,et al.  Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies , 2017, American journal of epidemiology.

[10]  Mark J van der Laan,et al.  History-adjusted marginal structural models for estimating time-varying effect modification. , 2007, American journal of epidemiology.

[11]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[12]  S. Stanojevic,et al.  Data Resource Profile: The UK Cystic Fibrosis Registry , 2017, International journal of epidemiology.

[13]  Stijn Vansteelandt,et al.  Structural nested models and G-estimation: the partially realized promise , 2014, 1503.01589.

[14]  Sherri Rose,et al.  Implementation of G-computation on a simulated data set: demonstration of a causal inference technique. , 2011, American journal of epidemiology.

[15]  Stephen R Cole,et al.  Constructing inverse probability weights for marginal structural models. , 2008, American journal of epidemiology.

[16]  J. Gustafson,et al.  Cystic Fibrosis , 2009, Journal of the Iowa Medical Society.

[17]  James M. Robins,et al.  Estimation of Effects of Sequential Treatments by Reparameterizing Directed Acyclic Graphs , 1997, UAI.

[18]  S. Vansteelandt,et al.  Revisiting g-estimation of the Effect of a Time-varying Exposure Subject to Time-varying Confounding , 2016 .

[19]  S. Nolan,et al.  Dornase alfa for cystic fibrosis. , 2017, Paediatric respiratory reviews.

[20]  S. Cole,et al.  Estimating the Effect of Cumulative Occupational Asbestos Exposure on Time to Lung Cancer Mortality: Using Structural Nested Failure-Time Models to Account for Healthy-Worker Survivor Bias , 2014, Epidemiology.

[21]  Eric B. Laber,et al.  Dynamic treatment regimes: Technical challenges and applications , 2014 .

[22]  J. Robins,et al.  G-Estimation of the Effect of Prophylaxis Therapy for Pneumocystis carinii Pneumonia on the Survival of AIDS Patients , 1992, Epidemiology.

[23]  M. J. van der Laan Targeted Maximum Likelihood Based Causal Inference: Part I , 2010, The international journal of biostatistics.

[24]  M. Petersen,et al.  Diagnosing Bias in the Inverse Probability of Treatment Weighted Estimator Resulting from Violation of Experimental Treatment Assignment , 2006 .

[25]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[26]  Jessica G. Young,et al.  Simulation from a known Cox MSM using standard parametric models for the g‐formula , 2014, Statistics in medicine.

[27]  Romain Neugebauer,et al.  Targeted learning in real‐world comparative effectiveness research with time‐varying interventions , 2014, Statistics in medicine.

[28]  H. Fuchs,et al.  Effect of aerosolized recombinant human DNase on exacerbations of respiratory symptoms and on pulmonary function in patients with cystic fibrosis. The Pulmozyme Study Group. , 1994, The New England journal of medicine.

[29]  S. Vansteelandt,et al.  Analysis of Longitudinal Studies With Repeated Outcome Measures: Adjusting for Time-Dependent Confounding Using Conventional Methods , 2017, American journal of epidemiology.