A generalized estimating equation approach to modelling incompatible data formats with covariate measurement error: application to human immunodeficiency virus immune markers

The integration of technological advances into research studies often raises an issue of incompatibility of data. This problem is common to longitudinal and multicentre studies, taking the form of changes in the definitions, acquisition of data or measuring instruments of some study variables. In our case of studying the relationship between a marker of immune response to human immunodeficiency virus and human immunodeficiency virus infection status, using data from the Multi-Center AIDS Cohort Study, changes in the manufactured tests used for both variables occurred throughout the study, resulting in data with different manufactured scales. In addition, the latent nature of the immune response of interest necessitated a further consideration of a measurement error component. We address the general issue of incompatibility of data, together with the issue of covariate measurement error, in a unified, generalized linear model setting with inferences based on the generalized estimating equation framework. General conditions are constructed to ensure consistent estimates and their variances for the primary model of interest, with the asymptotic behaviour of resulting estimates examined under a variety of modelling scenarios. The approach is illustrated by modelling a repeated ordinal response with incompatible formats, as a function of a covariate with incompatible formats and measurement error, based on the Multi-Center AIDS Cohort Study data.

[1]  S. Lipsitz,et al.  Analysis of repeated categorical data using generalized estimating equations. , 1994, Statistics in medicine.

[2]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[3]  A. L. Rae,et al.  The analysis of binomial data by a generalized linear mixed model , 1985 .

[4]  N. Breslow,et al.  Bias Correction in Generalized Linear Mixed Models with Multiple Components of Dispersion , 1996 .

[5]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[6]  J. Phair,et al.  The Multicenter AIDS Cohort Study: rationale, organization, and selected characteristics of the participants. , 1987, American journal of epidemiology.

[7]  Russell D. Wolfinger,et al.  Laplace's approximation for nonlinear mixed models. , 1993 .

[8]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[9]  Alice S. Whittemore,et al.  Errors-in-Variables Regression Using Stein Estimates , 1989 .

[10]  R. Prentice Surrogate endpoints in clinical trials: definition and operational criteria. , 1989, Statistics in medicine.

[11]  Steven M. Wolinsky,et al.  Human immunodeficiency virus type 1 infection in homosexual men who remain seronegative for prolonged periods. , 1989, The New England journal of medicine.

[12]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[13]  B. Mach,et al.  IDENTIFICATION OF HIV-INFECTED SERONEGATIVE INDIVIDUALS BY A DIRECT DIAGNOSTIC TEST BASED ON HYBRIDISATION TO AMPLIFIED VIRAL DNA , 1988, The Lancet.

[14]  Kenneth A. Bollen,et al.  Structural Equations with Latent Variables , 1989 .

[15]  K Y Liang,et al.  Efficacy of repeated measures in regression models with measurement error. , 1992, Biometrics.

[16]  Doris d'Cruz-Grote,et al.  Prevention of HIV infection in developing countries , 1996, The Lancet.

[17]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[18]  P. R. Fisk,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1971 .

[19]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[20]  W. Haseltine,et al.  Replication and pathogenesis of the AIDS virus. , 1988, Journal of acquired immune deficiency syndromes.

[21]  Anthony S. Fauci,et al.  Host factors and the pathogenesis of HIV-induced disease , 1996, Nature.

[22]  Raymond J. Carroll,et al.  Bias Analysis and SIMEX Approach in Generalized Linear Mixed Measurement Error Models , 1998 .

[23]  Victor DeGruttola,et al.  Modeling The Relationship Between Progression Of CD4-Lymphocyte Count And Survival Time , 1992 .

[24]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[25]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[26]  Dani P. Bolognesi Do antibodies enhance the infection of cells by HIV? , 1989, Nature.

[27]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[28]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.