Every missingness not at random model has a missingness at random counterpart with equal fit

Summary.  Over the last decade a variety of models to analyse incomplete multivariate and longitudinal data have been proposed, many of which allowing for the missingness to be not at random, in the sense that the unobserved measurements influence the process governing missingness, in addition to influences coming from observed measurements and/or covariates. The fundamental problems that are implied by such models, to which we refer as sensitivity to unverifiable modelling assumptions, has, in turn, sparked off various strands of research in what is now termed sensitivity analysis. The nature of sensitivity originates from the fact that a missingness not at random (MNAR) model is not fully verifiable from the data, rendering the empirical distinction between MNAR and missingness at random (MAR), where only covariates and observed outcomes influence missingness, difficult or even impossible, unless we are willing to accept the posited MNAR model in an unquestioning way. We show that the empirical distinction between MAR and MNAR is not possible, in the sense that each MNAR model fit to a set of observed data can be reproduced exactly by an MAR counterpart. Of course, such a pair of models will produce different predictions of the unobserved outcomes, given the observed outcomes. Theoretical considerations are supplemented with an illustration that is based on the Slovenian public opinion survey, which has been analysed before in the context of sensitivity analysis.

[1]  S. Fienberg,et al.  Two-Dimensional Contingency Tables with Both Completely and Partially Cross-Classified Data , 1974 .

[2]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[3]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[4]  Donald B. Rubin,et al.  Selection Modeling Versus Mixture Modeling with Nonignorable Nonresponse , 1986 .

[5]  Donald B. Rubin,et al.  Selection modelling versus mixture modelling with nonignorable nonresponse , 1986 .

[6]  Raymond J. Carroll,et al.  Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process , 1988 .

[7]  K. Bailey,et al.  Analysing changes in the presence of informative right censoring caused by death and withdrawal. , 1988, Statistics in medicine.

[8]  K. Bailey,et al.  Estimation and comparison of changes in the presence of informative right censoring: conditional linear model. , 1989, Biometrics.

[9]  W F Rosenberger,et al.  Closed-form estimates for missing counts in two-way contingency tables. , 1992, Statistics in medicine.

[10]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[11]  Geert Molenberghs,et al.  Discussion of Diggle, P. and Kenward, M. G.: 'Informative drop-out in longitudinal data analysis' , 1994 .

[12]  Roderick J. A. Little,et al.  A Class of Pattern-Mixture Models for Normal Incomplete Data , 1994 .

[13]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[14]  M. Kenward,et al.  Informative dropout in longitudinal data analysis (with discussion) , 1994 .

[15]  P. J. Diggle Discussion to Diggle, P. J. and Kenward, M. G. : Informative drop-out in longitudinal data analysis , 1994 .

[16]  D. Rubin,et al.  Handling “Don't Know” Survey Responses: The Case of the Slovenian Plebiscite , 1995 .

[17]  D. Follmann,et al.  An approximate generalized linear model with random effects for informative missing data. , 1995, Biometrics.

[18]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[19]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[20]  S G Baker,et al.  Marginal regression for repeated binary data with outcome subject to non-ignorable non-response. , 1995, Biometrics.

[21]  James M. Robins,et al.  Coarsening at Random: Characterizations, Conjectures, Counter-Examples , 1997 .

[22]  M. Kenward,et al.  The analysis of longitudinal ordinal data with nonrandom drop-out , 1997 .

[23]  M. Kenward Selection models for repeated measurements with non-random dropout: an illustration of sensitivity. , 1998, Statistics in medicine.

[24]  Geert Molenberghs,et al.  Monotone missing data and pattern‐mixture models , 1998 .

[25]  J R Landis,et al.  Mixed effects logistic regression models for longitudinal binary response data with informative drop-out. , 1998, Biometrics.

[26]  James M. Robins,et al.  Semiparametric Regression for Repeated Outcomes With Nonignorable Nonresponse , 1998 .

[27]  Yuichiro Kanazawa,et al.  Handling "Don't know" survey responses : the case of Japanese voters on party support , 2000 .

[28]  Geert Molenberghs,et al.  A local influence approach to sensitivity analysis of incomplete longitudinal ordinal data , 2001 .

[29]  Ana Ivelisse Avilés,et al.  Linear Mixed Models for Longitudinal Data , 2001, Technometrics.

[30]  G Molenberghs,et al.  Sensitivity Analysis for Nonrandom Dropout: A Local Influence Approach , 2001, Biometrics.

[31]  G. Molenberghs,et al.  Linear Mixed Models for Longitudinal Data , 2001 .

[32]  M. Kenward,et al.  Sensitivity analysis for incomplete contingency tables: the Slovenian plebiscite case , 2001 .

[33]  Hulin Wu,et al.  Design of viral dynamic studies for efficiently assessing potency of anti-HIV therapies in AIDS Clinical Trials , 2002 .

[34]  Stuart R. Lipsitz,et al.  Analysis of longitudinal data with non‐ignorable non‐monotone missing values , 2002 .

[35]  Geert Molenberghs,et al.  Strategies to fit pattern-mixture models. , 2002, Biostatistics.

[36]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[37]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[38]  Geert Molenberghs,et al.  A Local Influence Approach Applied to Binary Data from a Psychiatric Study , 2003, Biometrics.

[39]  Geert Molenberghs,et al.  Pattern‐mixture models with proper time dependence , 2003 .

[40]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[41]  Geert Molenberghs,et al.  Analyzing incomplete longitudinal clinical trial data. , 2004, Biostatistics.

[42]  G. Molenberghs,et al.  Models for Discrete Longitudinal Data , 2005 .

[43]  Geert Molenberghs,et al.  The nature of sensitivity in monotone missing not at random models , 2006, Comput. Stat. Data Anal..

[44]  L. Hunt,et al.  Missing Data in Clinical Studies , 2007 .

[45]  Corrigendum: A multifaceted sensitivity analysis of the Slovenian public opinion survey data , 2009 .