A joint back calculation model for the imputation of the date of HIV infection in a prevalent cohort

In studies of the natural history of HIV-1 infection, the time scale of primary interest is the time since infection. Unfortunately, this time is very often unknown for HIV infection and using the follow-up time instead of the time since infection is likely to provide biased results because of onset confounding. Laboratory markers such as the CD4 T-cell count carry important information concerning disease progression and can be used to predict the unknown date of infection. Previous work on this topic has made use of only one CD4 measurement or based the imputation on incident patients only. However, because of considerable intrinsic variability in CD4 levels and because incident cases are different from prevalent cases, back calculation based on only one CD4 determination per person or on characteristics of the incident sub-cohort may provide unreliable results. Therefore, we propose a methodology based on the repeated individual CD4 T-cells marker measurements that use both incident and prevalent cases to impute the unknown date of infection. Our approach uses joint modelling of the time since infection, the CD4 time path and the drop-out process. This methodology has been applied to estimate the CD4 slope and impute the unknown date of infection in HIV patients from the Swiss HIV Cohort Study. A procedure based on the comparison of different slope estimates is proposed to assess the goodness of fit of the imputation. Results of simulation studies indicated that the imputation procedure worked well, despite the intrinsic high volatility of the CD4 marker.

[1]  Gang Li,et al.  A Bayesian approach to joint analysis of longitudinal measurements and competing risks failure time data , 2007, Statistics in medicine.

[2]  R. Brookmeyer,et al.  The prevalent cohort study and the acquired immunodeficiency syndrome. , 1987, American journal of epidemiology.

[3]  H. Ullum,et al.  Effects of CCR5- 32, CCR2-64I, and SDF-1 3A Alleles on HIV-1 Disease Progression: An International Meta-Analysis of Individual-Patient Data , 2001, Annals of Internal Medicine.

[4]  R Henderson,et al.  Joint modelling of longitudinal measurements and event time data. , 2000, Biostatistics.

[5]  J J Goedert,et al.  Censoring in an epidemic with an application to hemophilia-associated AIDS. , 1989, Biometrics.

[6]  Jerry A. Hausman,et al.  Panel Data and Unobservable Individual Effects , 1981 .

[7]  V. De Gruttola,et al.  Modelling progression of CD4-lymphocyte count and its relationship to survival time. , 1994, Biometrics.

[8]  U. Dafni,et al.  Modeling the Progression of HIV Infection , 1991 .

[9]  I. Marschner,et al.  Using time of first positive HIV test and other auxiliary data in back-projection of AIDS incidence. , 1994, Statistics in medicine.

[10]  P. Bacchetti Estimating the Incubation Period of AIDS by Comparing Population Infection and Diagnosis Patterns , 1990 .

[11]  R B Geskus,et al.  Methods for estimating the AIDS incubation time distribution when date of seroconversion is censored , 2001, Statistics in medicine.

[12]  W J Boscardin,et al.  Longitudinal models for AIDS marker data , 1998, Statistical methods in medical research.

[13]  M. Wulfsohn,et al.  Modeling the Relationship of Survival to Longitudinal Data Measured with Error. Applications to Survival and CD4 Counts in Patients with AIDS , 1995 .

[14]  D. Vlahov,et al.  Long-term perspective on the prevalent-cohort biases in studies of human immunodeficiency virus progression. , 1997, American journal of epidemiology.

[15]  Susan R. Wilson,et al.  Bayesian projection of the acquired immune deficiency syndrome epidemic , 2002 .

[16]  Yudi Pawitan,et al.  Modeling Disease Marker Processes in AIDS , 1993 .

[17]  M D Schluchter,et al.  Methods for the analysis of informatively censored longitudinal data. , 1992, Statistics in medicine.

[18]  Mitchell H. Gail,et al.  AIDS Epidemiology: A Quantitative Approach , 1994 .

[19]  A. Phillips,et al.  The practical significance of potential biases in estimates of the AIDS incubation period distribution in the UK register of HIV seroconverters. , 1999, AIDS.

[20]  S. Berman A stochastic model for the distribution of HIV latency time based on T4 counts , 1990 .

[21]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[22]  R Brookmeyer,et al.  Biases in prevalent cohorts. , 1987, Biometrics.

[23]  D. Bates,et al.  Nonlinear mixed effects models for repeated measures data. , 1990, Biometrics.

[24]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[25]  N M Laird,et al.  Missing data in longitudinal studies. , 1988, Statistics in medicine.

[26]  S J Pocock,et al.  Estimation and comparison of rates of change in longitudinal studies with informative drop-outs. , 1999, Statistics in medicine.

[27]  J. Chmiel,et al.  Estimation of time since exposure for a prevalent cohort. , 1992, Statistics in medicine.

[28]  V. Beral,et al.  Effect of ignoring the time of HIV seroconversion in estimating changes in survival over calendar time in observational studies: results from CASCADE* , 2000, AIDS.

[29]  N G Becker,et al.  Estimating HIV incidence using dates of both HIV and AIDS diagnoses. , 2000, Statistics in medicine.

[30]  Edward W. Frees,et al.  Longitudinal and Panel Data , 2004 .

[31]  Mark D Schluchter,et al.  Shared parameter models for the joint analysis of longitudinal data and event times , 2006, Statistics in medicine.

[32]  B. Tindall,et al.  Estimation of time since infection using longitudinal disease-marker data. , 1994, Statistics in medicine.

[33]  J. Ward,et al.  Statistical analysis of the stages of HIV infection using a Markov model. , 1989, Statistics in medicine.

[34]  R. Elashoff,et al.  An approach to joint analysis of longitudinal measurements and competing risks failure time data , 2007, Statistics in medicine.

[35]  Jeremy M. G. Taylor,et al.  A Stochastic Model for Analysis of Longitudinal AIDS Data , 1994 .

[36]  Jerry A. Hausman,et al.  Panel Data and Unobservable Individual Effects , 1981 .

[37]  N M Laird,et al.  Model-based approaches to analysing incomplete longitudinal and failure time data. , 1997, Statistics in medicine.

[38]  Mitchell H. Gail,et al.  A Method for Obtaining Short-Term Projections and Lower Bounds on the Size of the AIDS Epidemic , 1988 .

[39]  James M. Robins,et al.  Semiparametric Regression for Repeated Outcomes With Nonignorable Nonresponse , 1998 .