The correction of risk estimates for measurement error.

PURPOSE The methods available for the correction of risk estimates for measurement errors are reviewed. The assumptions and design implications of each of the following six methods are noted: linear imputation, absolute limits, maximum likelihood, latent class, discriminant analysis and Gibbs sampling. METHODS All methods, with the exception of the absolute limits approach, require either repeated determinations on the same subjects with use of the methods that are prone to error or a validation study, in which the measurement is performed for a number of persons with use of both the error-prone method and a more accurate method regarded as a "gold standard". RESULTS The maximum likelihood, latent class and absolute limits methods are most suitable for purely discrete risk factors. The linear imputation methods and the closely related discrimination analysis method are suitable for continuous risk factors which, together with the errors of measurement, are usually assumed to be normally distributed. CONCLUSIONS The Gibbs sampling approach is, in principle, useful for both discrete and continuous risk factors and measurement errors, although its use does mandate that the user specify models and dependencies that may be very complex. Also, the Bayesian approach implicit in the use of Gibbs sampling is difficult to apply to the design of the case-control study.

[1]  N. Day,et al.  Diet, alcohol consumption and reproductive factors in a case‐control study of breast cancer in Moscow , 1991, International journal of cancer.

[2]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. , 2006, Statistics in medicine.

[3]  J. Selen Adjusting for errors in classification and measurement in the analysis of partly and purely categorical data , 1986 .

[4]  G. Casella,et al.  Explaining the Gibbs Sampler , 1992 .

[5]  S. Richardson,et al.  Conditional independence models for epidemiological studies with covariate measurement error. , 1993, Statistics in medicine.

[6]  S Greenland On correcting for misclassification in twin studies and other matched-pair studies. , 1989, Statistics in medicine.

[7]  D Clayton,et al.  Using test-retest reliability data to improve estimates of relative risk: an application of latent class analysis. , 1985, Statistics in medicine.

[8]  Correction of risk estimates for measurement error in epidemiology. , 1995, Methods of information in medicine.

[9]  A T Marinos,et al.  Experimental quantiles of epidemiological indices in case-control studies with non-differential misclassification. , 1995, Statistics in medicine.

[10]  William Mendenhall,et al.  Introduction to Probability and Statistics , 1961, The Mathematical Gazette.

[11]  Walter R. Gilks,et al.  A Language and Program for Complex Bayesian Modelling , 1994 .

[12]  R. Collins,et al.  Blood pressure, stroke, and coronary heart disease Part 2, short-term reductions in blood pressure: overview of randomised drug trials in their epidemiological context , 1990, The Lancet.

[13]  R. Collins,et al.  Blood pressure, stroke, and coronary heart disease Part 1, prolonged differences in blood pressure: prospective observational studies corrected for the regression dilution bias , 1990, The Lancet.

[14]  S Greenland,et al.  The effect of misclassification in matched-pair case-control studies. , 1982, American journal of epidemiology.

[15]  S Greenland,et al.  The effect of misclassification in the presence of covariates. , 1980, American journal of epidemiology.

[16]  A. Ekholm,et al.  Exponential family non‐linear models for categorical data with errors of observation , 1987 .

[17]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for random within-person measurement error. , 1992, American journal of epidemiology.

[18]  S. Greenland,et al.  Correcting for misclassification in two-way tables and matched-pair studies. , 1983, International journal of epidemiology.

[19]  S. Duffy,et al.  Risk factors for breast cancer: relevance to screening. , 1983, Journal of epidemiology and community health.

[20]  K. Liang,et al.  Adjustment for non-differential misclassification error in the generalized linear model. , 1991, Statistics in medicine.

[21]  A. Walker,et al.  Misclassification of covariates. , 1991, Statistics in medicine.

[22]  B G Armstrong,et al.  Analysis of case-control data with covariate measurement error: application to diet and colon cancer. , 1989, Statistics in medicine.

[23]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[24]  S. Duffy,et al.  Correcting for the effect of misclassification bias in a case-control study using data from two different questionnaires. , 1983, Biometrics.

[25]  M Blettner,et al.  What Does an Observed Relative Risk Convey about Possible Misclassification? , 1984, Methods of Information in Medicine.

[26]  N. Day,et al.  Misclassification in more than one factor in a case-control study: a combination of Mantel-Haenszel and maximum likelihood approaches. , 1989, Statistics in medicine.

[27]  S Richardson,et al.  A Bayesian approach to measurement error problems in epidemiology using conditional independence models. , 1993, American journal of epidemiology.

[28]  J Kaldor,et al.  Latent class analysis in chronic disease epidemiology. , 1985, Statistics in medicine.

[29]  G R Howe,et al.  A case‐control study of diet and colo‐rectal cancer , 1980, International journal of cancer.

[30]  S. Duffy,et al.  Repeat measurement of case-control data: correcting risk estimates for misclassification due to regression dilution of lipids in transient ischemic attacks and minor ischemic strokes. , 1991, American journal of epidemiology.

[31]  S W Duffy,et al.  External validation, repeat determination, and precision of risk estimation in misclassified exposure data in epidemiology. , 1992, Journal of epidemiology and community health.

[32]  W. Haenszel,et al.  Statistical aspects of the analysis of data from retrospective studies of disease. , 1959, Journal of the National Cancer Institute.

[33]  D.Sc. Joseph Berkson Are there Two Regressions , 1950 .