A Bivariate Pseudo-Likelihood for Incomplete Longitudinal Binary Data with Nonignorable Non-monotone Missingness

For analyzing longitudinal binary data with nonignorable and non-monotone missing responses, a full likelihood method is complicated algebraically, and often requires intensive computation, especially when there are many follow-up times. As an alternative, a pseudo-likelihood approach has been proposed in the literature under minimal parametric assumptions. This formulation only requires specification of the marginal distributions of the responses and missing data mechanism, and uses an independence working assumption. However, this estimator can be inefficient for estimating both time-varying and time-stationary effects under moderate to strong within-subject associations among repeated responses. In this article, we propose an alternative estimator, based on a bivariate pseudo-likelihood, and demonstrate in simulations that the proposed method can be much more efficient than the previous pseudo-likelihood obtained under the assumption of independence. We illustrate the method using longitudinal data on CD4 counts from two clinical trials of HIV-infected patients.

[1]  C. Varin,et al.  A note on composite likelihood inference and model selection , 2005 .

[2]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[3]  S. Lipsitz,et al.  Missing responses in generalised linear mixed models when the missing data mechanism is nonignorable , 2001 .

[4]  G Molenberghs,et al.  Patterns of opportunistic infections in patients with HIV infection. , 1996, Journal of acquired immune deficiency syndromes and human retrovirology : official publication of the International Retrovirology Association.

[5]  S G Baker,et al.  Marginal regression for repeated binary data with outcome subject to non-ignorable non-response. , 1995, Biometrics.

[6]  S. G. Meester,et al.  A parametric model for cluster correlated categorical data. , 1994, Biometrics.

[7]  G. Molenberghs,et al.  Marginal Modeling of Correlated Ordinal Data Using a Multivariate Plackett Distribution , 1994 .

[8]  J. C. van Houwelingen,et al.  Logistic Regression for Correlated Binary Data , 1994 .

[9]  R. Chaisson,et al.  Incidence and natural history of cytomegalovirus disease in patients with advanced human immunodeficiency virus disease treated with zidovudine. The Zidovudine Epidemiology Study Group. , 1992, The Journal of infectious diseases.

[10]  T. Merigan,et al.  A controlled trial comparing continued zidovudine with didanosine in human immunodeficiency virus infection. The NIAID AIDS Clinical Trials Group. , 1992, The New England journal of medicine.

[11]  S. Lipsitz,et al.  Generalized estimating equations for correlated binary data: Using the odds ratio as a measure of association , 1991 .

[12]  L. Zhao,et al.  Correlated binary regression using a quadratic exponential model , 1990 .

[13]  J. Goedert,et al.  The risk of Pneumocystis carinii pneumonia. , 1990, The New England journal of medicine.

[14]  Nan M. Laird,et al.  Regression Analysis for Categorical Variables with Outcome Subject to Nonignorable Nonresponse , 1988 .

[15]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[16]  Gail Gong,et al.  Pseudo Maximum Likelihood Estimation: Theory and Applications , 1981 .

[17]  D. Cox The Analysis of Multivariate Binary Data , 1972 .

[18]  R. Plackett A Class of Bivariate Distributions , 1965 .

[19]  Harry Joe,et al.  Composite Likelihood Methods , 2012 .

[20]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .