Discussion of PENCOMP

We would like to congratulate Zhou, Elliott, and Little (henceforth referred to as ZEL) on an interesting article in an important area of research. ZEL propose a new approach for causal inference using a full probability model that extends their earlier work for multiple imputation of missing data (Zhang and Little 2009). In addition, they extend their approach to more complex causal inference settings with time varying treatments and confounders. The proposed approach has numerous attractive features including it being relatively simple to implement, and that under “standard” assumptions it has the double robustness property like many semiparametric, moment-based estimators (Bang and Robins 2005). Given that the approach uses a full probability model (not just moments), attractive features of Bayesian inference for quantifying uncertainty about uncheckable assumptions (like no unmeasured confounding) can be part of the inference (Roy, Lum, and Daniels 2017; Kim et al. 2017). On the other hand the proposed approach has some limitations. First, it is “hard coded” to average causal effects and would require different models for other causal effects (e.g., quantile causal effects); of course, semiparametric approaches share the same limitation. However, it is not clear how straightforward it would be to extend the proposed approach to other causal effects. In Section 3.2, we discuss approaches that allow estimation of any causal effects. Second, it would seem unlikely that either the propensity score or outcome regression model would be correct within the simple models considered in the manuscript, especially when considering multiple time points in the longitudinal setting when consistency relies on correctly specifying multiple models at once. Further, “searching” for more complex models (i.e., interactions, nonlinearities) can be challenging. In Section 3.1, we will discuss “default” approaches that would result in considerably more flexible (and realistic) models. The rest of this discussion, where we will address three main issues, is organized as follows. In Section 2, we discuss the practical utility of the double robustness property in longitudinal settings. In Section 3, we discuss generalizing ZEL with more flexible regression models and an alternative using Bayesian nonparametric (BNP) methods for the regression of the outcome on confounders (i.e., not the propensity score). In Section 4, we discuss the issue of overlap/positivity violations in

[1]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[2]  Michael J. Daniels,et al.  A framework for Bayesian nonparametric inference for causal effects of mediation , 2017, Biometrics.

[3]  Xiao-Li Meng,et al.  Multiple-Imputation Inferences with Uncongenial Sources of Input , 1994 .

[4]  A. Gelman,et al.  ON THE STATIONARY DISTRIBUTION OF ITERATIVE IMPUTATIONS , 2010, 1012.2902.

[5]  David B. Dunson,et al.  Improving prediction from dirichlet process mixtures via enrichment , 2014, J. Mach. Learn. Res..

[6]  Michael J Daniels,et al.  A Bayesian nonparametric approach to marginal structural models for point treatments and a continuous or survival outcome. , 2017, Biostatistics.

[7]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[8]  Joseph Kang,et al.  Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2958.

[9]  James R Carpenter,et al.  Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model , 2012, Statistical methods in medical research.

[10]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[11]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[12]  Michael J Daniels,et al.  A note on compatibility for inference with missing data in the presence of auxiliary covariates , 2018, Statistics in medicine.

[13]  Dandan Xu,et al.  A Bayesian nonparametric approach to causal inference on quantiles , 2018, Biometrics.

[14]  Sonia Petrone,et al.  An enriched conjugate prior for Bayesian nonparametric inference , 2011 .

[15]  H. Chipman,et al.  BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[16]  Radford M. Neal Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[17]  Michael J Daniels,et al.  Bayesian nonparametric generative models for causal inference with missing at random covariates , 2017, Biometrics.

[18]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[19]  Dandan Xu,et al.  Sequential BART for imputation of missing covariates. , 2016, Biostatistics.

[20]  Jennifer Hill,et al.  Automated versus Do-It-Yourself Methods for Causal Inference: Lessons Learned from a Data Analysis Competition , 2017, Statistical Science.

[21]  Guangyu Zhang,et al.  Extensions of the Penalized Spline of Propensity Prediction Method of Imputation , 2009, Biometrics.

[22]  Edward H. Kennedy Nonparametric Causal Effects Based on Incremental Propensity Score Interventions , 2017, Journal of the American Statistical Association.

[23]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .