Adjustment for non-differential misclassification error in the generalized linear model.

It is well known that estimates of association between an outcome variable and a set of categorical covariates, some of which are measured with misclassification, tend to be biased upon application of the usual methods of estimation that ignore the classification error. We propose a method to adjust for misclassification in covariates when one applies the generalized linear model. In the case where one can observe some true covariates only through surrogates, we combine a latent class analysis with the approach to incorporate multiple surrogates into the model. We include discussion on the efficacy of repeated measurements which one can view as a special case of multiple surrogates with identical distribution. We provide two examples to demonstrate the applicability of the method and the efficacy of multiple replicates for a covariate subject to misclassification in a regression framework.

[1]  W. Pan,et al.  Positive relationship between urinary sodium chloride and blood pressure in Chinese health examinees and its association with calcium excretion. , 1990, Journal of hypertension.

[2]  J S Uebersax,et al.  Latent class analysis of diagnostic agreement. , 1990, Statistics in medicine.

[3]  W. Willett,et al.  An overview of issues related to the correction of non-differential exposure measurement error in epidemiologic studies. , 1989, Statistics in medicine.

[4]  T T Chen A review of methods for misclassified categorical data in epidemiology. , 1989, Statistics in medicine.

[5]  W. Stewart,et al.  An epidemiologic study of headache among adolescents and young adults. , 1989, JAMA.

[6]  L. Blackwood,et al.  Latent variable models for the analysis of medical data with repeated measures of binary variables. , 1988, Statistics in medicine.

[7]  S Greenland,et al.  Variance estimation for epidemiologic effect estimates under misclassification. , 1988, Statistics in medicine.

[8]  S D Walter,et al.  Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review. , 1988, Journal of clinical epidemiology.

[9]  S L Hui,et al.  A general approach to analyzing epidemiologic data that contain misclassification errors. , 1987, Biometrics.

[10]  A. Ekholm,et al.  Exponential family non‐linear models for categorical data with errors of observation , 1987 .

[11]  D. Bartholomew Latent Variable Models And Factor Analysis , 1987 .

[12]  J. Selen Adjusting for errors in classification and measurement in the analysis of partly and purely categorical data , 1986 .

[13]  D Clayton,et al.  Using test-retest reliability data to improve estimates of relative risk: an application of latent class analysis. , 1985, Statistics in medicine.

[14]  Charles L. Odoroff,et al.  Log-Linear Models for Doubly Sampled Categorical Data Fitted by the EM Algorithm , 1985 .

[15]  J Kaldor,et al.  Latent class analysis in chronic disease epidemiology. , 1985, Statistics in medicine.

[16]  B. Muthén A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators , 1984 .

[17]  Gail Gong,et al.  Pseudo Maximum Likelihood Estimation: Theory and Applications , 1981 .

[18]  S Greenland,et al.  The effect of misclassification in the presence of covariates. , 1980, American journal of epidemiology.

[19]  T. Chen,et al.  Log-Linear Models for Categorical Data with Misclassification and Double Sampling , 1979 .

[20]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[21]  H K Ury,et al.  Efficiency of case-control studies with multiple controls per case: continuous or dichotomous data. , 1975, Biometrics.

[22]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[23]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[24]  L. A. Goodman The Analysis of Systems of Qualitative Variables When Some of the Variables Are Unobservable. Part I-A Modified Latent Structure Approach , 1974, American Journal of Sociology.

[25]  A. Tenenbein A Double Sampling Scheme for Estimating from Binomial Data with Misclassifications , 1970 .

[26]  R. Anderson,et al.  AN INVESTIGATION OF THE EFFECT OF MISCLASSIFICATION ON THE PROPERTIES OF CHI-2-TESTS IN THE ANALYSIS OF CATEGORICAL DATA. , 1965, Biometrika.

[27]  I. Bross Misclassification in 2 X 2 Tables , 1954 .