A Semiparametric Approach for Analyzing Nonignorable Missing Data

In missing data analysis, there is often a need to assess the sensitivity of key inferences to departures from untestable assumptions regarding the missing data process. Such sensitivity analysis often requires specifying a missing data model which commonly assumes parametric functional forms for the predictors of missingness. In this paper, we relax the parametric assumption and investigate the use of a generalized additive missing data model. We also consider the possibility of a non-linear relationship between missingness and the potentially missing outcome, whereas the existing literature commonly assumes a more restricted linear relationship. To avoid the computational complexity, we adopt an index approach for local sensitivity. We derive explicit formulas for the resulting semiparametric sensitivity index. The computation of the index is simple and completely avoids the need to repeatedly fit the semiparametric nonignorable model. Only estimates from the standard software analysis are required with a moderate amount of additional computation. Thus, the semiparametric index provides a fast and robust method to adjust the standard estimates for nonignorable missingness. An extensive simulation study is conducted to evaluate the effects of misspecifying the missing data model and to compare the performance of the proposed approach with the commonly used parametric approaches. The simulation study shows that the proposed method helps reduce bias that might arise from the misspecification of the functional forms of predictors in the missing data model. We illustrate the method in a Wage Offer dataset.

[1]  A. Troxel,et al.  AN INDEX OF LOCAL SENSITIVITY TO NONIGNORABILITY , 2004 .

[2]  Yi Qian,et al.  Do National Patent Laws Stimulate Domestic Innovation in a Global Patenting Environment? A Cross-Country Analysis of Pharmaceutical Patent Protection, 19782002 , 2007, The Review of Economics and Statistics.

[3]  Hui Xie,et al.  Local Sensitivity to Nonignorability: Dependence on the Assumed Dropout Mechanism , 2009 .

[4]  Daniel F Heitjan,et al.  Sensitivity analysis of causal inference in a clinical trial subject to crossover , 2004, Clinical trials.

[5]  J. Copas,et al.  Local sensitivity approximations for selectivity bias , 2001 .

[6]  Daniel F Heitjan,et al.  An index of local sensitivity to nonignorable drop‐out in longitudinal modelling , 2005, Statistics in medicine.

[7]  Donald B. Rubin,et al.  Combining Panel Data Sets with Attrition and Refreshment Samples , 1998 .

[8]  J. Lafferty,et al.  Sparse additive models , 2007, 0711.4555.

[9]  D. Rubin,et al.  Ignorability and Coarse Data , 1991 .

[10]  Hui Xie Bayesian inference from incomplete longitudinal data: a simple method to quantify sensitivity to nonignorable dropout. , 2009, Statistics in medicine.

[11]  T. Mroz,et al.  The Sensitivity of an Empirical Model of Married Women's Hours of Work to Economic and Statistical Assumptions , 1987 .

[12]  Stuart R. Lipsitz,et al.  Analysis of longitudinal data with non‐ignorable non‐monotone missing values , 2002 .

[13]  J. Ibrahim,et al.  Semiparametric Models for Missing Covariate and Response Data in Regression Models , 2006, Biometrics.

[14]  Daniel F Heitjan,et al.  A Simple Local Sensitivity Analysis Tool for Nonignorable Coarsening: Application to Dependent Censoring , 2006, Biometrics.

[15]  Daniel F Heitjan,et al.  Impact of nonignorable coarsening on Bayesian inference. , 2006, Biostatistics.

[16]  W Vach,et al.  Logistic regression with incompletely observed categorical covariates--investigating the sensitivity against violation of the missing at random assumption. , 1995, Statistics in medicine.

[17]  Hui Xie,et al.  A local sensitivity analysis approach to longitudinal non‐Gaussian data with non‐ignorable dropout , 2008, Statistics in medicine.

[18]  D. O. Scharfstein Adjusting for nonignorable dropout using semiparametric nonresponse models (with discussion) , 1999 .

[19]  Yi Qian,et al.  Measuring the Impact of Nonignorability in Panel Data with Non-Monotone Nonresponse , 2012 .

[20]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[21]  A B Troxel,et al.  A comparative analysis of quality of life data from a Southwest Oncology Group randomized trial of advanced colorectal cancer. , 1998, Statistics in medicine.

[22]  M. Kenward Selection models for repeated measurements with non-random dropout: an illustration of sensitivity. , 1998, Statistics in medicine.

[23]  J. Copas,et al.  Inference for Non‐random Samples , 1997 .

[24]  G Molenberghs,et al.  Sensitivity Analysis for Nonrandom Dropout: A Local Influence Approach , 2001, Biometrics.