Pattern-mixture models for multivariate incomplete data with covariates.

Pattern-mixture models stratify incomplete data by the pattern of missing values and formulate distinct models within each stratum. Pattern-mixture models are developed for analyzing a random sample on continuous variables y(1), y(2) when values of y(2) are nonrandomly missing. Methods for scalar y(1) and y(2) are here generalized to vector y(1) and y(2) with additional fixed covariates x. Parameters in these models are identified by alternative assumptions about the missing-data mechanism. Models may be underidentified (in which case additional assumptions are needed), just-identified, or overidentified. Maximum likelihood and Bayesian methods are developed for the latter two situations, using the EM and SEM algorithms, direct and interactive simulation methods. The methods are illustrated on a data set involving alternative dosage regimens for the treatment of schizophrenia using haloperidol and on a regression example. Sensitivity to alternative assumptions about the missing-data mechanism is assessed, and the new methods are compared with complete-case analysis and maximum likelihood for a probit selection model.

[1]  Xiao-Li Meng,et al.  Using EM to Obtain Asymptotic Variance-Covariance Matrices: The SEM Algorithm , 1991 .

[2]  Raymond J. Carroll,et al.  Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process , 1988 .

[3]  Donald B. Rubin,et al.  Multiple imputation in mixture models for nonignorable nonresponse with follow-ups , 1993 .

[4]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[5]  K. Bailey,et al.  Estimation and comparison of changes in the presence of informative right censoring: conditional linear model. , 1989, Biometrics.

[6]  Roderick J. A. Little Regression with Missing X's: A Review , 1992 .

[7]  Andrew L. Rukhin,et al.  Tools for statistical inference , 1991 .

[8]  B. Efron,et al.  Compliance as an Explanatory Variable in Clinical Trials , 1991 .

[9]  C H Brown,et al.  Protecting against nonrandomly missing data in longitudinal studies. , 1990, Biometrics.

[10]  Adrian F. M. Smith,et al.  Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[11]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  M. Kenward,et al.  Informative dropout in longitudinal data analysis (with discussion) , 1994 .

[14]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[15]  T. Amemiya Tobit models: A survey , 1984 .

[16]  Donald B. Rubin,et al.  Selection Modeling Versus Mixture Modeling with Nonignorable Nonresponse , 1986 .

[17]  R F Woolson,et al.  Slope estimation in the presence of informative right censoring: modeling the number of observations as a geometric random variable. , 1994, Biometrics.

[18]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[19]  R. Little Models for Nonresponse in Sample Surveys , 1982 .

[20]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[21]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .