Multivariate logistic regression with incomplete covariate and auxiliary information

In this article, we propose and explore a multivariate logistic regression model for analyzing multiple binary outcomes with incomplete covariate data where auxiliary information is available. The auxiliary data are extraneous to the regression model of interest but predictive of the covariate with missing data. describe how the auxiliary information can be incorporated into a regression model for a single binary outcome with missing covariates, and hence the efficiency of the regression estimators can be improved. We consider extending the method of Horton and Laird (2001) to the case of a multivariate logistic regression model for multiple correlated outcomes, and with missing covariates and completely observed auxiliary information. We demonstrate that in the case of moderate to strong associations among the multiple outcomes, one can achieve considerable gains in efficiency from estimators in a multivariate model as compared to the marginal estimators of the same parameters.

[1]  John W. McDonald,et al.  Marginal regression analysis of a multivariate binary response , 1995 .

[2]  J. Ibrahim Incomplete Data in Generalized Linear Models , 1990 .

[3]  Monica A. Walker,et al.  Studies in Item Analysis and Prediction. , 1962 .

[4]  G. Molenberghs,et al.  Likelihood and quasi-likelihood based methods for analysing multivariate categorical data, with the association between outcomes of interest , 1996 .

[5]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[6]  M P Becker,et al.  Marginal modeling of binary cross-over data. , 1993, Biometrics.

[7]  A. Agresti,et al.  Simultaneously Modeling Joint and Marginal Distributions of Multivariate Categorical Responses , 1994 .

[8]  G. Molenberghs,et al.  Marginal Modeling of Correlated Ordinal Data Using a Multivariate Plackett Distribution , 1994 .

[9]  S. Zeger,et al.  Multivariate Regression Analyses for Categorical Data , 1992 .

[10]  N M Laird,et al.  Maximum likelihood regression methods for paired binary data. , 1990, Statistics in medicine.

[11]  S W Lagakos,et al.  Adjusting for early treatment termination in comparative clinical trials. , 1990, Statistics in medicine.

[12]  G. Glonek A class of regression models for multivariate categorical responses , 1996 .

[13]  N M Laird,et al.  Maximum Likelihood Analysis of Logistic Regression Models with Incomplete Covariate Data and Auxiliary Information , 2001, Biometrics.

[14]  G Molenberghs,et al.  Methods for analyzing multivariate binary data, with association between outcomes of interest. , 1996, Biometrics.

[15]  P. McCullagh,et al.  Multivariate Logistic Models , 1995 .

[16]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[17]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[18]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[19]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[20]  N. Laird,et al.  A likelihood-based method for analysing longitudinal binary responses , 1993 .

[21]  J. Dale,et al.  Local versus global association for bivariate ordered responses , 1984 .