A toolkit for measurement error correction, with a focus on nutritional epidemiology

Exposure measurement error is a problem in many epidemiological studies, including those using biomarkers and measures of dietary intake. Measurement error typically results in biased estimates of exposure-disease associations, the severity and nature of the bias depending on the form of the error. To correct for the effects of measurement error, information additional to the main study data is required. Ideally, this is a validation sample in which the true exposure is observed. However, in many situations, it is not feasible to observe the true exposure, but there may be available one or more repeated exposure measurements, for example, blood pressure or dietary intake recorded at two time points. The aim of this paper is to provide a toolkit for measurement error correction using repeated measurements. We bring together methods covering classical measurement error and several departures from classical error: systematic, heteroscedastic and differential error. The correction methods considered are regression calibration, which is already widely used in the classical error setting, and moment reconstruction and multiple imputation, which are newer approaches with the ability to handle differential error. We emphasize practical application of the methods in nutritional epidemiology and other fields. We primarily consider continuous exposures in the exposure-outcome model, but we also outline methods for use when continuous exposures are categorized. The methods are illustrated using the data from a study of the association between fibre intake and colorectal cancer, where fibre intake is measured using a diet diary and repeated measures are available for a subset. © 2014 The Authors.

[1]  D. Ruppert,et al.  Nonparametric regression in the presence of measurement error , 1999 .

[2]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. , 2006, Statistics in medicine.

[3]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[4]  P. F. Kauff Group , 2000, Elegant Design.

[5]  A F Subar,et al.  Design and serendipity in establishing a large cohort with wide dietary intake distributions : the National Institutes of Health-American Association of Retired Persons Diet and Health Study. , 2001, American journal of epidemiology.

[6]  Raymond J Carroll,et al.  A comparison of a food frequency questionnaire with a 24-hour recall for use in an epidemiological cohort study: results from the biomarker-based Observing Protein and Energy Nutrition (OPEN) study. , 2003, International journal of epidemiology.

[7]  D. Spiegelman,et al.  Regression Calibration with Heteroscedastic Error Variance , 2011, The international journal of biostatistics.

[8]  R. Collins,et al.  Blood pressure, stroke, and coronary heart disease Part 1, prolonged differences in blood pressure: prospective observational studies corrected for the regression dilution bias , 1990, The Lancet.

[9]  J. Kuha,et al.  Corrections for exposure measurement error in logistic regression models with an application to nutritional data. , 1994, Statistics in medicine.

[10]  D. Hall Measurement Error in Nonlinear Models: A Modern Perspective , 2008 .

[11]  Marie Davidian,et al.  A Moment‐Adjusted Imputation Method for Measurement Error Models , 2011, Biometrics.

[12]  R. Little,et al.  Regression analysis with covariates that have heteroscedastic measurement error , 2011, Statistics in medicine.

[13]  Raymond J Carroll,et al.  Bias in dietary-report instruments and its implications for nutritional epidemiology , 2002, Public Health Nutrition.

[14]  S A Bingham,et al.  Urine nitrogen as an independent validatory measure of dietary intake: a study of nitrogen balance in individuals consuming their normal diet. , 1985, The American journal of clinical nutrition.

[15]  I. White,et al.  Alcohol intake and risk of colorectal cancer: Results from the UK Dietary Cohort Consortium , 2010, British Journal of Cancer.

[16]  S. Greenland Dose‐Response and Trend Analysis in Epidemiology: Alternatives to Categorical Analysis , 1995, Epidemiology.

[17]  Jeremiah Stamler,et al.  Intersalt: an international study of electrolyte excretion and blood pressure. Results for 24 hour urinary sodium and potassium excretion. Intersalt Cooperative Research Group. , 1988 .

[18]  N. Day,et al.  Are imprecise methods obscuring a relation between fat and breast cancer? , 2003, The Lancet.

[19]  T. Key,et al.  Vitamins, minerals, essential fatty acids and colorectal cancer risk in the United Kingdom Dietary Cohort Consortium , 2012, International journal of cancer.

[20]  I. White,et al.  Dietary fat and breast cancer: comparison of results from food diaries and food-frequency questionnaires in the UK Dietary Cohort Consortium. , 2011, The American journal of clinical nutrition.

[21]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[22]  Raymond J Carroll,et al.  Structure of dietary measurement error: results of the OPEN biomarker study. , 2003, American journal of epidemiology.

[23]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[24]  J. R. Cook,et al.  Simulation-Extrapolation Estimation in Parametric Measurement Error Models , 1994 .

[25]  David Ruppert,et al.  Local polynomial regression and simulation–extrapolation , 2004 .

[26]  R. Carroll,et al.  A comparison of two dietary instruments for evaluating the fat-breast cancer relationship. , 2006, International journal of epidemiology.

[27]  Ruth H. Keogh,et al.  Estimating the alcohol–breast cancer association: a comparison of diet diaries, FFQs and combined measurements , 2012, European Journal of Epidemiology.

[28]  Ruth H. Keogh,et al.  Allowing for never and episodic consumers when correcting for error in food record measurements of dietary intake , 2011, Biostatistics.

[29]  I. White,et al.  Dietary fiber and colorectal cancer risk: a nested case-control study using food diaries. , 2010, Journal of the National Cancer Institute.

[30]  M. Hughes,et al.  Regression dilution in the proportional hazards model. , 1993, Biometrics.

[31]  J. Danesh,et al.  Regression dilution methods for meta-analysis: assessing long-term variability in plasma fibrinogen among 27,247 adults in 15 prospective studies. , 2006, International journal of epidemiology.

[32]  John B. Carlin,et al.  Bias and efficiency of multiple imputation compared with complete‐case analysis for missing covariate values , 2010, Statistics in medicine.

[33]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for random within-person measurement error. , 1992, American journal of epidemiology.

[34]  Patrick Royston,et al.  Multiple imputation using chained equations: Issues and guidance for practice , 2011, Statistics in medicine.

[35]  I. White,et al.  Correcting for Bias due to Misclassification when Error-prone Continuous Exposures Are Misclassified , 2012 .

[36]  Raymond J Carroll,et al.  A comparison of regression calibration, moment reconstruction and imputation for adjusting for covariate measurement error in regression , 2008, Statistics in medicine.

[37]  I. White,et al.  Intake of dietary fats and colorectal cancer risk: prospective findings from the UK Dietary Cohort Consortium. , 2010, Cancer epidemiology.

[38]  Ruth H. Keogh,et al.  Meat, poultry and fish and risk of colorectal cancer: pooled analysis of data from the UK dietary cohort consortium , 2010, Cancer Causes & Control.

[39]  Ruth H. Keogh,et al.  Effects of Classical Exposure Measurement Error on the Shape of Exposure-Disease Associations , 2012 .

[40]  M. Singer,et al.  Nutritional Epidemiology , 2020, Definitions.

[41]  Raymond J Carroll,et al.  Modeling Data with Excess Zeros and Measurement Error: Application to Evaluating Relationships between Episodically Consumed Foods and Health Outcomes , 2009, Biometrics.

[42]  Lena Osterhagen,et al.  Multiple Imputation For Nonresponse In Surveys , 2016 .

[43]  Patrick Royston,et al.  Multiple Imputation by Chained Equations (MICE): Implementation in Stata , 2011 .

[44]  D A Schoeller,et al.  Measurement of energy expenditure in free-living humans by using doubly labeled water. , 1988, The Journal of nutrition.

[45]  D Spiegelman,et al.  Correlated errors in biased surrogates: study designs and methods for measurement error correction , 2005, Statistics in medicine.

[46]  E. Riboli,et al.  Nutrition and cancer: background and rationale of the European Prospective Investigation into Cancer and Nutrition (EPIC). , 1992, Annals of oncology : official journal of the European Society for Medical Oncology.

[47]  B Rosner,et al.  Regression calibration method for correcting measurement-error bias in nutritional epidemiology. , 1997, The American journal of clinical nutrition.

[48]  K. Flegal,et al.  Differential misclassification arising from nondifferential errors in exposure measurement. , 1991, American journal of epidemiology.

[49]  M. Wong,et al.  Epidemiological assessment of diet: a comparison of a 7-day diary with a food frequency questionnaire using urinary markers of nitrogen, potassium and sodium. , 2001, International journal of epidemiology.

[50]  Raymond J Carroll,et al.  A New Method for Dealing with Measurement Error in Explanatory Variables of Regression Models , 2004, Biometrics.

[51]  G. Scally Intersalt: an international study of electrolyte excretion and blood pressure. Results for 24 hour urinary sodium and potassium excretion. Intersalt Cooperative Research Group. , 1988, BMJ.

[52]  H. Boshuizen,et al.  Multiple imputation of missing blood pressure covariates in survival analysis. , 1999, Statistics in medicine.

[53]  R. Keogh,et al.  Vitamin C intake from diary recordings and risk of breast cancer in the UK Dietary Cohort Consortium , 2012, European Journal of Clinical Nutrition.

[54]  S C Darby,et al.  Some aspects of measurement error in explanatory variables for continuous and binary regression models. , 1998, Statistics in medicine.

[55]  R. Carroll,et al.  Efficient regression calibration for logistic regression in main study/internal validation study designs with an imperfect reference instrument. , 2001, Statistics in medicine.

[56]  R J Carroll,et al.  Empirical evidence of correlated biases in dietary assessment instruments and its implications. , 2001, American journal of epidemiology.

[57]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for measurement error: the case of multiple covariates measured with error. , 1990, American journal of epidemiology.

[58]  Salomaa Regression dilution methods for meta-analysis: assessing long-term variability in plasma fibrinogen among 27 247 adults in 15 prospective studies , 2006 .

[59]  Sander Greenland,et al.  Multiple-imputation for measurement-error correction. , 2006, International journal of epidemiology.