Quantile Regression Methods for Longitudinal Data with Drop‐outs: Application to CD4 Cell Counts of Patients Infected with the Human Immunodeficiency Virus

Patients infected with the human immunodeficiency virus (HIV) generally experience a decline in their CD4 cell count (a count of certain white blood cells). We describe the use of quantile regression methods to analyse longitudinal data on CD4 cell counts from 1300 patients who participated in clinical trials that compared two therapeutic treatments: zidovudine and didanosine. It is of scientific interest to determine any treatment differences in the CD4 cell counts over a short treatment period. However, the analysis of the CD4 data is complicated by drop‐outs: patients with lower CD4 cell counts at the base‐line appear more likely to drop out at later measurement occasions. Motivated by this example, we describe the use of `weighted' estimating equations in quantile regression models for longitudinal data with drop‐outs. In particular, the conventional estimating equations for the quantile regression parameters are weighted inversely proportionally to the probability of drop‐out. This approach requires the process generating the missing data to be estimable but makes no assumptions about the distribution of the responses other than those imposed by the quantile regression model. This method yields consistent estimates of the quantile regression parameters provided that the model for drop‐out has been correctly specified. The methodology proposed is applied to the CD4 cell count data and the results are compared with those obtained from an `unweighted' analysis. These results demonstrate how an analysis that fails to account for drop‐outs can mislead.

[1]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[2]  Roger Koenker,et al.  An empirical quantile function for linear models with iid errors , 1981 .

[3]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[4]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[5]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[6]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[7]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[8]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[9]  Gary Chamberlain,et al.  QUANTILE REGRESSION, CENSORING, AND THE STRUCTURE OF WAGES , 1991 .

[10]  Andrea Rotnitzky,et al.  Regression Models for Discrete Longitudinal Responses , 1993 .

[11]  N. Laird,et al.  [Regression Models for Discrete Longitudinal Responses]: Rejoinder , 1993 .

[12]  J. Neuhaus Estimation efficiency and tests of covariate effects with clustered binary data. , 1993, Biometrics.

[13]  S. Lipsitz,et al.  Performance of generalized estimating equations in practical situations. , 1994, Biometrics.

[14]  Moshe Buchinsky CHANGES IN THE U.S. WAGE STRUCTURE 1963-1987: APPLICATION OF QUANTILE REGRESSION , 1994 .

[15]  G. Fitzmaurice,et al.  A caveat concerning independence estimating equations with multivariate binary data. , 1995, Biometrics.

[16]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[17]  Moshe Buchinsky Estimating the asymptotic covariance matrix for quantile regression models a Monte Carlo study , 1995 .

[18]  Sin-Ho Jung Quasi-Likelihood for Median Regression Models , 1996 .

[19]  G Molenberghs,et al.  Patterns of opportunistic infections in patients with HIV infection. , 1996, Journal of acquired immune deficiency syndromes and human retrovirology : official publication of the International Retrovirology Association.