Identification of Causal Effects Within Principal Strata Using Auxiliary Variables

In causal inference, principal stratification is a framework for dealing with a posttreatment intermediate variable between a treatment and an outcome, in which the principal strata are defined by the joint potential values of the intermediate variable. Because the principal strata are not fully observable, the causal effects within them, also known as the principal causal effects, are not identifiable without additional assumptions. Several previous empirical studies leveraged auxiliary variables to improve the inference of principal causal effects. We establish a general theory for identification and estimation of the principal causal effects with auxiliary variables, which provides a solid foundation for statistical inference and more insights for model building in empirical research. In particular, we consider two commonly-used strategies for principal stratification problems: principal ignorability, and the conditional independence between the auxiliary variable and the outcome given principal strata and covariates. For these two strategies, we give non-parametric and semi-parametric identification results without modeling assumptions on the outcome. When the assumptions for neither strategies are plausible, we propose a large class of flexible parametric and semi-parametric models for identifying principal causal effects. Our theory not only ensures formal identification results of several models that have been used in previous empirical studies but also generalizes them to allow for different types of outcomes and intermediate variables.

[1]  Yingyao Hu,et al.  NONPARAMETRIC IDENTIFICATION USING INSTRUMENTAL VARIABLES: SUFFICIENT CONDITIONS FOR COMPLETENESS , 2011, Econometric Theory.

[2]  Dylan S. Small,et al.  Bounds on causal effects in three‐arm trials with non‐compliance , 2006 .

[3]  Chuanhai Liu Robit Regression: A Simple Robust Alternative to Logistic and Probit Regression , 2005 .

[4]  Donald B. Rubin,et al.  Estimation of Causal Effects via Principal Stratification When Some Outcomes are Truncated by “Death” , 2003 .

[5]  Erin E Gabriel,et al.  Augmented trial designs for evaluation of principal surrogates. , 2016, Biostatistics.

[6]  Dean Follmann,et al.  Augmented Designs to Assess Immune Response in Vaccine Trials , 2006, Biometrics.

[7]  F. Mealli,et al.  Augmented designs to assess principal strata direct effects , 2011 .

[8]  A. Zellner Bayesian and Non-Bayesian Analysis of the Regression Model with Multivariate Student- t Error Terms , 1976 .

[9]  T. VanderWeele Simple relations between principal stratification and direct and indirect effects , 2008 .

[10]  P. Rosenbaum The Consequences of Adjustment for a Concomitant Variable that Has Been Affected by the Treatment , 1984 .

[11]  Jiannan Lu,et al.  Principal stratification analysis using principal scores , 2016, 1602.01196.

[12]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[13]  D. Rubin,et al.  Assessing the effect of an influenza vaccine in an encouragement design. , 2000, Biostatistics.

[14]  Michael R Elliott,et al.  Bayesian inference for causal mediation effects using principal stratification with dichotomous mediators and outcomes. , 2010, Biostatistics.

[15]  A. Cohen,et al.  Finite Mixture Distributions , 1982 .

[16]  Kosuke Imai,et al.  mediation: R Package for Causal Mediation Analysis , 2014 .

[17]  Elizabeth A Stuart,et al.  Assessing the sensitivity of methods for estimating principal causal effects , 2015, Statistical methods in medical research.

[18]  B. Efron,et al.  Compliance as an Explanatory Variable in Clinical Trials , 1991 .

[19]  Paul Gustafson,et al.  What Are the Limits of Posterior Distributions Arising From Nonidentified Models, and Why Should We Care? , 2009 .

[20]  Joseph W Hogan,et al.  Principal stratification with predictors of compliance for randomized trials with 2 active treatments. , 2008, Biostatistics.

[21]  Luke W. Miratrix,et al.  Weak Separation in Mixture Models and Implications for Principal Stratification , 2016, AISTATS.

[22]  R. Gallop,et al.  Mediation analysis with principal stratification , 2009, Statistics in medicine.

[23]  Donald B. Rubin,et al.  Evaluating the Effect of Training on Wages in the Presence of Noncompliance, Nonemployment, and Missing Outcome Data , 2012 .

[24]  P. Bickel,et al.  Mathematical Statistics: Basic Ideas and Selected Topics , 1977 .

[25]  Judea Pearl,et al.  Letter to the Editor: Remarks on the Method of Propensity Score , 2009 .

[26]  D. Rubin,et al.  Principal Stratification in Causal Inference , 2002, Biometrics.

[27]  A. Vinokur,et al.  Impact of the JOBS intervention on unemployed workers varying in risk for depression , 1995, American journal of community psychology.

[28]  Donald B Rubin,et al.  Principal Stratification for Causal Inference With Extended Partial Compliance , 2008 .

[29]  Using survival information in truncation by death problems without the monotonicity assumption , 2018, Biometrics.

[30]  M. Hudgens,et al.  Assessing Vaccine Effects in Repeated Low‐Dose Challenge Experiments , 2009, Biometrics.

[31]  L. Keele,et al.  A General Approach to Causal Mediation Analysis , 2010, Psychological methods.

[32]  Booil Jo,et al.  Causal inference in randomized experiments with mediational processes. , 2008, Psychological methods.

[33]  Elizabeth A Stuart,et al.  On the use of propensity scores in principal causal effect estimation , 2009, Statistics in medicine.

[34]  P. Ding,et al.  Principal ignorability in mediation analysis: through and beyond sequential ignorability , 2018, Biometrika.

[35]  Fabrizia Mealli,et al.  Identification of Principal Causal Effects Using Additional Outcomes in Concentration Graphs , 2016 .

[36]  S. Zeger,et al.  On estimating efficacy from clinical trials. , 1991, Statistics in medicine.

[37]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[38]  Francesco Bartolucci,et al.  Modeling Partial Compliance Through Copulas in a Principal Stratification Framework , 2011 .

[39]  Dean A. Follmann,et al.  On the Effect of Treatment among Would-Be Treatment Compliers: An Analysis of the Multiple Risk Factor Intervention Trial , 2000 .

[40]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[41]  D. Rubin Causal Inference Through Potential Outcomes and Principal Stratification: Application to Studies with “Censoring” Due to Death , 2006, math/0612783.

[42]  R. Little,et al.  Statistical Techniques for Analyzing Data from Prevention Trials: Treatment of No-Shows Using Rubin's Causal Model , 1998 .

[43]  Dylan Small,et al.  Defining and Estimating Intervention Effects for Groups that will Develop an Auxiliary Outcome , 2007 .

[44]  Fan Li,et al.  Causal Inference: A Missing Data Perspective , 2017, 1712.06170.

[45]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: A General Method for Estimating Sampling Variances for Standard Estimators for Average Causal Effects , 2015 .

[46]  Peter B Gilbert,et al.  Comparing Biomarkers as Principal Surrogate Endpoints , 2011, Biometrics.

[47]  Thomas S. Richardson,et al.  Identification and estimation of causal effects with outcomes truncated by death , 2016, Biometrika.

[48]  A. Vinokur,et al.  Mastery and inoculation against setbacks as active ingredients in the JOBS intervention for the unemployed. , 1997, Journal of consulting and clinical psychology.

[49]  Peng Ding,et al.  On the Conditional Distribution of the Multivariate t Distribution , 2016, 1604.00561.

[50]  Jeremy MG Taylor,et al.  Robust Statistical Modeling Using the t Distribution , 1989 .

[51]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[52]  Joseph W Hogan,et al.  Bayesian Inference for the Causal Effect of Mediation , 2012, Biometrics.

[53]  W. G. Cochran Analysis of covariance: Its nature and uses. , 1957 .

[54]  Donald B. Rubin,et al.  Likelihood-Based Analysis of Causal Effects of Job-Training Programs Using Principal Stratification , 2009 .

[55]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[56]  Fan Li,et al.  A Bayesian Semiparametric Approach to Intermediate Variables in Causal Inference , 2011 .

[57]  Corwin M Zigler,et al.  Health effects of power plant emissions through ambient air quality , 2020, Journal of the Royal Statistical Society: Series A (Statistics in Society).

[58]  Kjell A. Doksum,et al.  Mathematical Statistics: Basic Ideas and Selected Topics, Volume I, Second Edition , 2015 .

[59]  Kosuke Imai,et al.  Sharp bounds on the causal effects in randomized experiments with "truncation-by-death" , 2008 .

[60]  Myron Melnyk,et al.  Principles of applied statistics , 1974 .

[61]  Michael G Hudgens,et al.  Evaluating Candidate Principal Surrogate Endpoints , 2008, Biometrics.

[62]  Fabrizia Mealli,et al.  Using Secondary Outcomes to Sharpen Inference in Randomized Experiments With Noncompliance , 2013 .

[63]  Jin Tian,et al.  On the Identification of Causal Effects , 2015 .

[64]  Donald B. Rubin,et al.  The fragility of standard inferential approaches in principal stratification models relative to direct likelihood approaches , 2016, Stat. Anal. Data Min..

[65]  Peter B Gilbert,et al.  ASSESSING SURROGATE ENDPOINTS IN VACCINE TRIALS WITH CASE-COHORT SAMPLING AND THE COX MODEL. , 2008, The annals of applied statistics.

[66]  M R Elliott,et al.  Surrogacy assessment using principal stratification and a Gaussian copula model , 2017, Statistical methods in medical research.

[67]  Corwin M Zigler,et al.  A Bayesian Approach to Improved Estimation of Causal Effect Predictiveness for a Principal Surrogate Endpoint , 2012, Biometrics.

[68]  E. Stuart,et al.  The Use of Propensity Scores in Mediation Analysis , 2011, Multivariate behavioral research.

[69]  Zhi Geng,et al.  Identifiability and Estimation of Causal Effects by Principal Stratification With Outcomes Truncated by Death , 2011 .

[70]  Zhi Geng,et al.  Principal causal effect identification and surrogate end point evaluation by multiple trials , 2015, 1507.05935.

[71]  Luke W. Miratrix,et al.  Identifying and estimating principal causal effects in a multi-site trial of Early College High Schools , 2019, The Annals of Applied Statistics.

[72]  Paul Gustafson,et al.  Bayesian Inference for Partially Identified Models: Exploring the Limits of Limited Data , 2015 .