Measurement error correction using validation data: a review of methods and their applicability in case-control studies

Measurement error is a serious problem in the analysis of epidemiological data. In the past 20 years, a large number of methods for the correction of measurement error have been developed. While at the beginning mostly methods for cohort studies were considered, recently more attention has been paid to case-control studies. Although a variety of methods have been proposed, they are very rarely used in practice. To stimulate their use and further development, this article provides a comprehensive overview on methods developed for multivariable regression analysis of epidemiologic studies with validation data sets. The methods are systematically classified with respect to the underlying theory. An assessment of prerequisites, assumptions and performance of the available methods is given. Particular attention is paid to applicability to case-control studies and need for further research and development is pointed out.

[1]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[2]  P. Dellaportas,et al.  BAYESIAN ANALYSIS OF ERRORS-IN-VARIABLES REGRESSION MODELS , 1995 .

[3]  W. Willett,et al.  An overview of issues related to the correction of non-differential exposure measurement error in epidemiologic studies. , 1989, Statistics in medicine.

[4]  H Brenner Use and Limitations of Dual Measurements in Correcting for Nondifferential Exposure Misclassification , 1992, Epidemiology.

[5]  S Richardson,et al.  A Bayesian approach to measurement error problems in epidemiology using conditional independence models. , 1993, American journal of epidemiology.

[6]  Raymond J. Carroll,et al.  Semiparametric Estimation in Logistic Measurement Error Models , 1989 .

[7]  Raymond J. Carroll,et al.  Approximate Quasi-likelihood Estimation in Models with Surrogate Predictors , 1990 .

[8]  A. Walker,et al.  Misclassification of covariates. , 1991, Statistics in medicine.

[9]  C Brownie,et al.  The effects of exposure misclassification on estimates of relative risk. , 1986, American journal of epidemiology.

[10]  R. Gunst,et al.  Polynomial measurement error modeling , 1995 .

[11]  S Greenland,et al.  The effect of misclassification in the presence of covariates. , 1980, American journal of epidemiology.

[12]  Mitchell H. Gail,et al.  Case-Control Studies With Errors in Covariates , 1993 .

[13]  Hermann Brenner,et al.  Correcting for Exposure Misclassification Using an Alloyed Gold Standard , 1996, Epidemiology.

[14]  B Rosner,et al.  A Bayesian approach to logistic regression models having measurement error following a mixture distribution. , 1993, Statistics in medicine.

[15]  J R Marshall,et al.  The use of dual or multiple reports in epidemiologic studies. , 1989, Statistics in medicine.

[16]  S D Walter,et al.  Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review. , 1988, Journal of clinical epidemiology.

[17]  Karl-Heinz Jöckel,et al.  Logistic analysis in case-control studies under validation sampling , 1993 .

[18]  J. R. Cook,et al.  Simulation-Extrapolation: The Measurement Error Jackknife , 1995 .

[19]  T R Holford,et al.  Study design for epidemiologic studies with measurement error , 1995, Statistical methods in medical research.

[20]  S Wacholder,et al.  Validation studies using an alloyed gold standard. , 1993, American journal of epidemiology.

[21]  S. Richardson,et al.  Conditional independence models for epidemiological studies with covariate measurement error. , 1993, Statistics in medicine.

[22]  N E Breslow,et al.  Weighted likelihood, pseudo-likelihood and maximum likelihood methods for logistic regression analysis of two-stage data. , 1997, Statistics in medicine.

[23]  L Leblond,et al.  Some comments on misspecification of priors in Bayesian modelling of measurement error problems. , 1997, Statistics in medicine.

[24]  J. Robins,et al.  Analysis of case-control data derived in part from proxy respondents. , 1988, American journal of epidemiology.

[25]  Lung-fei Lee,et al.  Estimation of Linear and Nonlinear Errors-in-Variables Models Using Validation Data , 1995 .

[26]  L L Kupper,et al.  Effects of the use of unreliable surrogate variables on the validity of epidemiologic research studies. , 1984, American journal of epidemiology.

[27]  J Kaldor,et al.  Latent class analysis in chronic disease epidemiology. , 1985, Statistics in medicine.

[28]  S Senn Covariance analysis in generalized linear measurement error models. , 1990, Statistics in medicine.

[29]  Correction of risk estimates for measurement error in epidemiology. , 1995, Methods of information in medicine.

[30]  H Brenner,et al.  Bias due to non-differential misclassification of polytomous confounders. , 1993, Journal of clinical epidemiology.

[31]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. , 2006, Statistics in medicine.

[32]  M Dosemeci,et al.  Does nondifferential misclassification of exposure always bias a true effect toward the null value? , 1990, American journal of epidemiology.

[33]  Margaret S. Pepe,et al.  A mean score method for missing and auxiliary covariate data in regression models , 1995 .

[34]  J. Kuha,et al.  Corrections for exposure measurement error in logistic regression models with an application to nutritional data. , 1994, Statistics in medicine.

[35]  R J Carroll Covariance analysis in generalized linear measurement error models. , 1989, Statistics in medicine.

[36]  S W Duffy,et al.  External validation, repeat determination, and precision of risk estimation in misclassified exposure data in epidemiology. , 1992, Journal of epidemiology and community health.

[37]  James M. Robins,et al.  Semiparametric efficient estimation of a conditional density with missing or mismeasured covariates , 1995 .

[38]  D Spiegelman,et al.  Fully parametric and semi-parametric regression models for common events with covariate measurement error in main study/validation study designs. , 1997, Biometrics.

[39]  Alice S. Whittemore,et al.  Approximations for Regression with Covariate Measurement Error , 1988 .

[40]  J Kuha Estimation by data augmentation in regression models with continuous and discrete covariates measured with error. , 1997, Statistics in medicine.

[41]  D Spiegelman,et al.  Measurement error correction for logistic regression models with an "alloyed gold standard". , 1997, American journal of epidemiology.

[42]  Yosef Hochberg,et al.  On the Use of Double Sampling Schemes in Analyzing Categorical Data with Misclassification Errors , 1977 .

[43]  Mustafa Dosemeci,et al.  RE: “DOES NONDIFFERENTIAL MISCLASSIFICATION OF EXPOSURE ALWAYS BIAS A TRUE EFFECT TOWARD THE NULL VALUE?” , 1991 .

[44]  I. Bross Misclassification in 2 X 2 Tables , 1954 .

[45]  Raymond J. Carroll,et al.  Asymptotics for the SIMEX Estimator in Nonlinear Measurement Error Models , 1996 .

[46]  D. Savitz,et al.  The effects of joint misclassification of exposure and disease on epidemiologic measures of association. , 1993, Journal of clinical epidemiology.

[47]  K. Flegal,et al.  Differential misclassification arising from nondifferential errors in exposure measurement. , 1991, American journal of epidemiology.

[48]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[49]  Thomas R. Fleming,et al.  A Nonparametric Method for Dealing with Mismeasured Covariate Data , 1991 .

[50]  B. Gladen,et al.  Misclassification and the design of environmental studies. , 1979, American journal of epidemiology.

[51]  Judith D. Goldberg,et al.  The Effects of Misclassification on the Bias in the Difference Between Two Proportions and the Relative Odds in the Fourfold Table , 1975 .

[52]  Charles L. Odoroff,et al.  Log-Linear Models for Doubly Sampled Categorical Data Fitted by the EM Algorithm , 1985 .

[53]  J. R. Cook,et al.  Simulation-Extrapolation Estimation in Parametric Measurement Error Models , 1994 .

[54]  B G Armstrong,et al.  Analysis of case-control data with covariate measurement error: application to diet and colon cancer. , 1989, Statistics in medicine.

[55]  Kathryn Roeder,et al.  A Bayesian semiparametric model for case-control studies with errors in variables , 1997 .

[56]  L A Stefanski,et al.  A Measurement Error Model for Binary and Ordinal Regression Title: a Measurement Error Model for Binary and Ordinal Regression , 2022 .

[57]  L. Kupper,et al.  Inferences About Exposure-Disease Associations Using Probability-of-Exposure Information , 1993 .

[58]  Norman E. Breslow,et al.  Logistic regression for two-stage case-control data , 1988 .

[59]  T T Chen A review of methods for misclassified categorical data in epidemiology. , 1989, Statistics in medicine.

[60]  T. Chen,et al.  Log-Linear Models for Categorical Data with Misclassification and Double Sampling , 1979 .

[61]  R. Carroll,et al.  Prospective Analysis of Logistic Case-Control Studies , 1995 .

[62]  S L Hui,et al.  A general approach to analyzing epidemiologic data that contain misclassification errors. , 1987, Biometrics.

[63]  T. Kohlmann,et al.  Latent class analysis in medical research , 1996, Statistical methods in medical research.

[64]  D. Thomas,et al.  Exposure measurement error: influence on exposure-disease. Relationships and methods of correction. , 1993, Annual review of public health.

[65]  A comparative study of four methods for analysing repeated measures data. , 1996, Statistics in medicine.

[66]  H Checkoway,et al.  Bias due to misclassification in the estimation of relative risk. , 1977, American journal of epidemiology.

[67]  K. Roeder,et al.  A Semiparametric Mixture Approach to Case-Control Studies with Errors in Covariables , 1996 .

[68]  Raymond J. Carroll,et al.  A Semiparametric Correction for Attenuation , 1994 .

[69]  D. Spiegelhalter,et al.  Modelling Complexity: Applications of Gibbs Sampling in Medicine , 1993 .

[70]  Raymond J. Carroll,et al.  On errors-in-variables for binary regression models , 1984 .

[71]  R. Prentice,et al.  Further results on covariate measurement errors in cohort studies with time to response data. , 1989, Statistics in medicine.

[72]  J. Selen Adjusting for errors in classification and measurement in the analysis of partly and purely categorical data , 1986 .

[73]  J. Haukka,et al.  Correction for covariate measurement error in generalized linear models--a bootstrap approach. , 1995, Biometrics.

[74]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for measurement error: the case of multiple covariates measured with error. , 1990, American journal of epidemiology.

[75]  D Clayton,et al.  Using test-retest reliability data to improve estimates of relative risk: an application of latent class analysis. , 1985, Statistics in medicine.