Impact of Missing Data Due to Selective Dropouts in Cohort Studies and Clinical Trials

Background. Many cohort studies and clinical trials use repeated measurements of laboratory markers to track disease progression and to evaluate new therapies. A major problem in the analysis of such studies is that marker data are censored in some patients owing to withdrawal, loss to follow-up, or death. The objective of this paper is to evaluate the impact of selective dropouts attributable to death or disease progression on the estimates of marker change among different groups. Methods. Data on CD4 cell count in human immunodeficiency virus 1-infected individuals from a clinical trial and a cohort study are used to illustrate this problem and a possible solution. Simulation studies are also presented. Results. When the rate of dropout is greater in subjects whose marker status is declining rapidly, commonly used methods, like random effects models, that ignore informative dropouts lead to overoptimistic statements about the marker trends in all compared groups, because subjects with steeper marker drops tend to have shorter follow-up times and hence are weighted less in the estimation of the group rate of the average marker decline. Conclusions. The potential biases attributable to incomplete data require greater recognition in longitudinal studies. Sensitivity analyses to assess the effect of dropouts are important.

[1]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[2]  M D Schluchter,et al.  Analysis of change in the presence of informative censoring: application to a longitudinal clinical trial of progressive renal disease , 2001, Statistics in medicine.

[3]  S J Pocock,et al.  Estimation and comparison of rates of change in longitudinal studies with informative drop-outs. , 1999, Statistics in medicine.

[4]  J. Goedert,et al.  Effects of age at seroconversion and baseline HIV RNA level on the loss of CD4+ cells among persons with hemophilia , 1998, AIDS.

[5]  A. Hatzakis,et al.  Determinants of progression of HIV infection in a Greek hemophilia cohort followed for up to 16 years after seroconversion. , 1998, Journal of acquired immune deficiency syndromes and human retrovirology : official publication of the International Retrovirology Association.

[6]  C. Rizza,et al.  Importance of age at infection with HIV-1 for survival and development of AIDS in UK haemophilia population , 1996, The Lancet.

[7]  J. Singer,et al.  Impact of missing data due to dropouts on estimates of the treatment effect in a randomized trial of antiretroviral therapy for HIV-infected individuals. Canadian HIV Trials Network A002 Study Group. , 1996, Journal of acquired immune deficiency syndromes and human retrovirology : official publication of the International Retrovirology Association.

[8]  The Alpha trial: European/Australian randomized double-blind trial of two doses of didanosine in zidovudine-intolerant patients with symptomatic HIV disease. Alpha International Coordinating Committee. , 1996, AIDS.

[9]  S C Choi,et al.  Effect of non-random missing data mechanisms in clinical trials. , 1995, Statistics in medicine.

[10]  P. Diggle Analysis of Longitudinal Data , 1995 .

[11]  S. Crawford,et al.  A comparison of anlaytic methods for non-random missingness of outcome data. , 1995, Journal of clinical epidemiology.

[12]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[13]  V. De Gruttola,et al.  Modelling progression of CD4-lymphocyte count and its relationship to survival time. , 1994, Biometrics.

[14]  Jeremy M. G. Taylor,et al.  A Stochastic Model for Analysis of Longitudinal AIDS Data , 1994 .

[15]  James J. Goedert,et al.  Effect of age at seroconversion on the natural AIDS incubation distribution , 1994, AIDS.

[16]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[17]  J J Goedert,et al.  HIV-1 infection incidence among persons with hemophilia in the United States and western Europe, 1978-1990. Multicenter Hemophilia Cohort Study. , 1994, Journal of acquired immune deficiency syndromes.

[18]  M. Kenward,et al.  Informative dropout in longitudinal data analysis (with discussion) , 1994 .

[19]  C. Sabin,et al.  Laboratory control values for CD4 and CD8 T lymphocytes. Implications for HIV‐1 diagnosis , 1992, Clinical and experimental immunology.

[20]  A. Gelfand,et al.  Hierarchical Bayes Models for the Progression of HIV Infection Using Longitudinal CD4 T-Cell Numbers , 1992 .

[21]  M D Schluchter,et al.  Methods for the analysis of informatively censored longitudinal data. , 1992, Statistics in medicine.

[22]  R F Woolson,et al.  Application of empirical Bayes inference to estimation of rate of change in the presence of informative right censoring. , 1992, Statistics in medicine.

[23]  S J Pocock,et al.  Repeated measures in clinical trials: analysis using mean summary statistics and its implications for design. , 1992, Statistics in medicine.

[24]  J Elford,et al.  More rapid progression to AIDS in older HIV-infected people: the role of CD4+ T-cell counts. , 1991, Journal of acquired immune deficiency syndromes.

[25]  S W Lagakos,et al.  Analyzing laboratory marker changes in AIDS clinical trials. , 1991, Journal of acquired immune deficiency syndromes.

[26]  S Senn,et al.  Analysis of serial measurements in medical research. , 1990, BMJ.

[27]  J. Goedert,et al.  A prospective study of human immunodeficiency virus type 1 infection and the development of AIDS in subjects with hemophilia. , 1989, The New England journal of medicine.

[28]  Joan S. Chmiel,et al.  Acquired immunodeficiency syndrome (AIDS)-free time after human immunodeficiency virus type 1 (HIV-1) seroconversion in homosexual men. Multicenter AIDS Cohort Study Group. , 1989, American journal of epidemiology.

[29]  J. Wittes,et al.  Surrogate endpoints in clinical trials: cardiovascular diseases. , 1989, Statistics in medicine.

[30]  Raymond J. Carroll,et al.  Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process , 1988 .

[31]  K. Bailey,et al.  Analysing changes in the presence of informative right censoring caused by death and withdrawal. , 1988, Statistics in medicine.

[32]  N M Laird,et al.  Missing data in longitudinal studies. , 1988, Statistics in medicine.

[33]  R. P. Falcão,et al.  Age-associated changes of T lymphocyte subsets. , 1987, Diagnostic and clinical immunology.

[34]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.