Correlated measurement error--implications for nutritional epidemiology.

BACKGROUND In nutritional epidemiology, it is common to fit models in which several dietary variables are included. However, with standard instruments for dietary assessment, not only are the intakes of many nutrients often highly correlated, but the errors in the estimation of the intake of different nutrients are also correlated. The effect of this error correlation on the results of observational studies has been little investigated. This paper describes the effect on multivariate regression coefficients of different levels of correlation, both between the variables themselves and between the errors of estimation of these variables. METHODS Using a simple model for the multivariate error structure, we examine the effect on the estimates of bivariate linear regression coefficients of (1) differential precision of measurement of the two independent variables, (2) differing levels of correlation between the true values of the two variables, and (3) differing levels of correlation between the errors of measurement of the two variables. As an example, the prediction of plasma vitamin C levels by dietary intake variables is considered, using data from the European Prospective Investigation of Cancer (EPIC) Norfolk study in which dietary intake was estimated using both a food frequency questionnaire (FFQ) and a 7-day diary (7DD). The dietary variables considered are vitamin C, fat, and energy, with different approaches taken to energy adjustment. RESULTS When the error correlation is zero, the estimates of the bivariate regression coefficients reflect the precision of measurement of the two variables and mutual confounding. The sum of the observed regression coefficients is biased towards the null as in univariate regression. When the error correlation is non-zero but below about 0.7, the effect is minor. However, as the error correlation increases beyond 0.8 the effect becomes large and highly dependent on the relative precision with which the two variables are measured. At the extreme, the bivariate estimates can become indefinitely large. In the example, the error correlation between fat and energy using the FFQ appears to be over 0.9, the corresponding value for the 7DD being approximately 0.85. The error correlation between vitamin C and fat, and vitamin C and energy, appears to be below 0.5 and smaller for the 7DD than for the FFQ. The impact of these error correlations on bivariate regression coefficients is large. The effect of energy adjustment differs widely between vitamin C and fat. CONCLUSION High levels of error correlation can have a large effect on bivariate regression estimates, varying widely depending on which two variables are considered. In particular, the effect of energy adjustment will vary widely. For vitamin C, the effect of energy adjustment appears negligible, whereas for fat the effect is large indicating that error correlation close to one can partially remove regression dilution due to measurement error. If, for fat intake, energy adjustment is performed by using energy density, the partial removal of regression dilution is achieved at the expense of substantial reduction in the true variance.

[1]  N. Day,et al.  Adjusting for energy intake--what measure to use in nutritional epidemiological studies? , 2004, International journal of epidemiology.

[2]  N. Day,et al.  The effect of correlated measurement error in multivariate models of diet. , 2004, American journal of epidemiology.

[3]  E. Bertone-Johnson Nutrition and Lifestyle: Opportunities for Cancer Prevention , 2003 .

[4]  Raymond J Carroll,et al.  Structure of dietary measurement error: results of the OPEN biomarker study. , 2003, American journal of epidemiology.

[5]  D. Midthune,et al.  Comparative validation of the Block, Willett, and National Cancer Institute food frequency questionnaires : the Eating at America's Table Study. , 2001, American journal of epidemiology.

[6]  M. Wong,et al.  Epidemiological assessment of diet: a comparison of a 7-day diary with a food frequency questionnaire using urinary markers of nitrogen, potassium and sodium. , 2001, International journal of epidemiology.

[7]  W. Willett Commentary: Dietary diaries versus food frequency questionnaires-a case of undigestible data. , 2001, International journal of epidemiology.

[8]  M Y Wong,et al.  Measurement error in epidemiology: the design of validation studies II: bivariate situation. , 1999, Statistics in medicine.

[9]  N. Day,et al.  EPIC-Norfolk: study design and characteristics of the cohort. European Prospective Investigation of Cancer. , 1999, British journal of cancer.

[10]  S Wacholder,et al.  Effect of measurement error on energy-adjustment models in nutritional epidemiology. , 1997, American journal of epidemiology.

[11]  S. Bingham,et al.  7. Biochemical markers of nutrient intake , 1997 .

[12]  M. L. Nelson,et al.  Design concepts in nutritional epidemiology , 1997 .

[13]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for measurement error: the case of multiple covariates measured with error. , 1990, American journal of epidemiology.

[14]  N. Day,et al.  Some methodological issues in nutritional epidemiology. , 2002, IARC scientific publications.

[15]  N E Day,et al.  Validation of dietary assessment methods in the UK arm of EPIC using weighed records, and 24-hour urinary nitrogen and potassium and serum vitamin C and carotenoids as biomarkers. , 1997, International journal of epidemiology.