Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Missing data are frequently encountered in the statistical analysis of randomized experiments. In this paper, I propose statistical methods that can be used to analyze randomized experiments with a nonignorable missing binary outcome where the missing-data mechanism may depend on the unobserved values of the outcome variable itself. I first introduce an identification strategy for the average treatment effect and compare it with the existing alternative approaches in the literature. I then derive the maximum likelihood estimator and its asymptotic properties, and discuss possible estimation methods. Furthermore, since the proposed identification assumption is not directly verifiable from the data, I show how to conduct a sensitivity analysis based on the parameterization that links the key identification assumption with the causal quantities of interest. Then, the proposed methodology is extended to the analysis of randomized experiments with noncompliance. Although the method introduced in this paper may not directly apply to randomized experiments with non-binary outcomes, I briefly discuss possible identification strategies in more general situations. Finally, I apply the proposed methodology to analyze data from the German election experiment and the influenza vaccination study, which originally motivated the methodological problems addressed in this paper.

[1]  Donald B. Rubin,et al.  Principal Stratification Approach to Broken Randomized Experiments , 2003 .

[2]  G. Imbens Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review , 2004 .

[3]  C J McDonald,et al.  Effects of computer reminders for influenza vaccination on morbidity during influenza epidemics. , 1992, M.D. computing : computers in medical practice.

[4]  Xiao-Hua Zhou,et al.  ITT analysis of randomized encouragement design studies with missing data , 2006, Statistics in medicine.

[5]  R. Little,et al.  Inference for the Complier-Average Causal Effect From Longitudinal Data Subject to Noncompliance and Missing Data, With Application to a Job Training Assessment for the Unemployed , 2001 .

[6]  Barry C. Burden Voter Turnout and the National Election Studies , 2000, Political Analysis.

[7]  A James O'Malley,et al.  Likelihood Methods for Treatment Noncompliance and Subsequent Nonresponse in Randomized Trials , 2005, Biometrics.

[8]  G. W. Imbens Sensitivity to Exogeneity Assumptions in Program Evaluation , 2003 .

[9]  G. Imbens,et al.  Analyzing a randomized trial on breast self-examination with noncompliance and missing outcomes. , 2004, Biostatistics.

[10]  D. Rubin,et al.  Assessing Sensitivity to an Unobserved Binary Covariate in an Observational Study with Binary Outcome , 1983 .

[11]  C. McDonald,et al.  Reminders to physicians from an introspective computer medical record. A two-year randomized trial. , 1984, Annals of internal medicine.

[12]  D. Rubin,et al.  Addressing complications of intention-to-treat analysis in the combined presence of all-or-none treatment-noncompliance and subsequent missing outcomes , 1999 .

[13]  David R. Cox Planning of Experiments , 1958 .

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[16]  P. Rosenbaum Sensitivity analysis for certain permutation inferences in matched observational studies , 1987 .

[17]  Xiao-Hua Zhou,et al.  Clustered encouragement designs with individual noncompliance: bayesian inference with randomization, and application to advance directive forms. , 2002, Biostatistics.

[18]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[19]  J. Horowitz,et al.  Nonparametric Analysis of Randomized Experiments with Missing Covariate and Outcome Data , 2000 .

[20]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[21]  D. Rubin,et al.  Assessing the effect of an influenza vaccine in an encouragement design. , 2000, Biostatistics.

[22]  Donald B. Rubin,et al.  Combining Panel Data Sets with Attrition and Refreshment Samples , 1998 .

[23]  R. Kronmal,et al.  Assessing the sensitivity of regression results to unmeasured confounders in observational studies. , 1998, Biometrics.

[24]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[25]  D. Clayton,et al.  Statistical Models in Epidemiology , 1993 .