Pattern–mixture and selection models for analysing longitudinal data with monotone missing patterns

We examine three pattern-mixture models for making inference about parameters of the distribution of an outcome of interest "Y" that is to be measured at the end of a longitudinal study when this outcome is missing in some subjects. We show that these pattern-mixture models also have an interpretation as selection models. Because these models make unverifiable assumptions, we recommend that inference about the distribution of "Y" be repeated under a range of plausible assumptions. We argue that, of the three models considered, only one admits a parameterization that facilitates the examination of departures from the assumption of sequential ignorability. The three models are nonparametric in the sense that they do not impose restrictions on the class of observed data distributions. Owing to the curse of dimensionality, the assumptions that are encoded in these models are sufficient for identification but not for inference. We describe additional flexible and easily interpretable assumptions under which it is possible to construct estimators that are well behaved with moderate sample sizes. These assumptions define semiparametric models for the distribution of the observed data. We describe a class of estimators which, up to asymptotic equivalence, comprise all the consistent and asymptotically normal estimators of the parameters of interest under the postulated semiparametric models. We illustrate our methods with the analysis of data from a randomized clinical trial of contracepting women. Copyright 2003 Royal Statistical Society.

[1]  D. Rubin,et al.  Ignorability and Coarse Data , 1991 .

[2]  J. Robins,et al.  Correcting for Noncompliance and Dependent Censoring in an AIDS Clinical Trial with Inverse Probability of Censoring Weighted (IPCW) Log‐Rank Tests , 2000, Biometrics.

[3]  M J Campbell,et al.  Assessing changes in vaginal bleeding patterns in contracepting women. , 1988, Contraception.

[4]  L. Hansen Large Sample Properties of Generalized Method of Moments Estimators , 1982 .

[5]  W. Newey,et al.  16 Efficient estimation of models with conditional moment restrictions , 1993 .

[6]  K. Bailey,et al.  Estimation and comparison of changes in the presence of informative right censoring: conditional linear model. , 1989, Biometrics.

[7]  J. Robins,et al.  Sensitivity Analysis for Selection bias and unmeasured Confounding in missing Data and Causal inference models , 2000 .

[8]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[9]  Erik V. Nordheim,et al.  Inference from Nonrandomly Missing Categorical Data: An Example from a Genetic Study on Turner's Syndrome , 1984 .

[10]  J. Robins A graphical approach to the identification and estimation of causal parameters in mortality studies with sustained exposure periods. , 1987, Journal of chronic diseases.

[11]  Hulin Wu,et al.  Design of viral dynamic studies for efficiently assessing potency of anti-HIV therapies in AIDS Clinical Trials , 2002 .

[12]  N M Laird,et al.  Mixture models for the joint distribution of repeated measures and event times. , 1997, Statistics in medicine.

[13]  R. Little,et al.  Pattern-mixture models for multivariate incomplete data with covariates. , 1996, Biometrics.

[14]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[15]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[16]  Raymond J. Carroll,et al.  Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process , 1988 .

[17]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[18]  M. Kenward,et al.  The analysis of longitudinal ordinal data with nonrandom drop-out , 1997 .

[19]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[20]  D Scharfstein,et al.  Inference in Randomized Studies with Informative Censoring and Discrete Time‐to‐Event Endpoints , 2001, Biometrics.

[21]  D. Rubin,et al.  The Bias Due to Incomplete Matching , 1985 .

[22]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[23]  D Scharfstein,et al.  Methods for Conducting Sensitivity Analysis of Trials with Potentially Nonignorable Competing Causes of Censoring , 2001, Biometrics.

[24]  James M. Robins,et al.  Semiparametric Regression for Repeated Outcomes With Nonignorable Nonresponse , 1998 .

[25]  D. Follmann,et al.  An approximate generalized linear model with random effects for informative missing data. , 1995, Biometrics.

[26]  D. Rubin,et al.  Assessing Sensitivity to an Unobserved Binary Covariate in an Observational Study with Binary Outcome , 1983 .

[27]  Roderick J. A. Little,et al.  A Class of Pattern-Mixture Models for Normal Incomplete Data , 1994 .

[28]  Geert Molenberghs,et al.  Monotone missing data and pattern‐mixture models , 1998 .

[29]  Peter J. Diggle,et al.  Missing data mechanisms and pattern-mixture models , 1998 .

[30]  J. Robins,et al.  G-Estimation of the Effect of Prophylaxis Therapy for Pneumocystis carinii Pneumonia on the Survival of AIDS Patients , 1992, Epidemiology.