A NEW MULTIVARIATE MEASUREMENT ERROR MODEL WITH ZERO-INFLATED DIETARY DATA, AND ITS APPLICATION TO DIETARY ASSESSMENT.

In the United States the preferred method of obtaining dietary intake data is the 24-hour dietary recall, yet the measure of most interest is usual or long-term average daily intake, which is impossible to measure. Thus, usual dietary intake is assessed with considerable measurement error. Also, diet represents numerous foods, nutrients and other components, each of which have distinctive attributes. Sometimes, it is useful to examine intake of these components separately, but increasingly nutritionists are interested in exploring them collectively to capture overall dietary patterns. Consumption of these components varies widely: some are consumed daily by almost everyone on every day, while others are episodically consumed so that 24-hour recall data are zero-inflated. In addition, they are often correlated with each other. Finally, it is often preferable to analyze the amount of a dietary component relative to the amount of energy (calories) in a diet because dietary recommendations often vary with energy level. The quest to understand overall dietary patterns of usual intake has to this point reached a standstill. There are no statistical methods or models available to model such complex multivariate data with its measurement error and zero inflation. This paper proposes the first such model, and it proposes the first workable solution to fit such a model. After describing the model, we use survey-weighted MCMC computations to fit the model, with uncertainty estimation coming from balanced repeated replication.The methodology is illustrated through an application to estimating the population distribution of the Healthy Eating Index-2005 (HEI-2005), a multi-component dietary quality index involving ratios of interrelated dietary components to energy, among children aged 2-8 in the United States. We pose a number of interesting questions about the HEI-2005 and provide answers that were not previously within the realm of possibility, and we indicate ways that our approach can be used to answer other questions of importance to nutritional science and public health.

[1]  Alexander Meister,et al.  Density estimation with heteroscedastic error , 2008, 0805.2216.

[2]  R. Prentice Dietary assessment and the reliability of nutritional epidemiology reports , 2003, The Lancet.

[3]  Aurore Delaigle,et al.  Using SIMEX for Smoothing-Parameter Choice in Errors-in-Variables Problems , 2008 .

[4]  Emmanuel Lesaffre,et al.  A General Method for Dealing with Misclassification in Regression: The Misclassification SIMEX , 2006, Biometrics.

[5]  R. Prentice,et al.  Measurement error and results from analytic epidemiology: dietary fat and breast cancer. , 1996, Journal of the National Cancer Institute.

[6]  A. Carriquiry Estimation of usual intake distributions of nutrients and foods. , 2003, The Journal of nutrition.

[7]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[8]  Alexander Kukush,et al.  Measurement Error Models , 2011, International Encyclopedia of Statistical Science.

[9]  Jill Reedy,et al.  Evaluation of the Healthy Eating Index-2005. , 2008, Journal of the American Dietetic Association.

[10]  Raymond J Carroll,et al.  Modeling Data with Excess Zeros and Measurement Error: Application to Evaluating Relationships between Episodically Consumed Foods and Health Outcomes , 2009, Biometrics.

[11]  Gary K Grunwald,et al.  Analysis of repeated measures data with clumping at zero , 2002, Statistical methods in medical research.

[12]  John P. Buonaccorsi,et al.  Measurement Error: Models, Methods, and Applications , 2010 .

[13]  P. Hall,et al.  ESTIMATION OF OBSERVATION-ERROR VARIANCE IN ERRORS-IN-VARIABLES REGRESSION , 2011 .

[14]  D. Wagstaff,et al.  Fitting a linear model to survey data when the long-term average daily intake of a dietary component is an explanatory variable , 2009 .

[15]  S. Kirkpatrick,et al.  Development of the Healthy Eating Index‐2010 , 2008, Journal of the American Dietetic Association.

[16]  D. Ruppert,et al.  Density Estimation in the Presence of Heteroscedastic Measurement Error , 2008 .

[17]  X. R. L N K C Z V E A Y H G U T X R L N K C Z V E A Y H [General method]. , 2000, Diabetes & metabolism.

[18]  A. Carriquiry,et al.  A Semiparametric Transformation Approach to Estimating Usual Daily Intake Distributions , 1996 .

[19]  F. Clavel-Chapelon,et al.  A bivariate measurement error model for nitrogen and potassium intakes to evaluate the performance of regression calibration in the European Prospective Investigation into Cancer and Nutrition study , 2009, European Journal of Clinical Nutrition.

[20]  Murali Haran,et al.  Markov chain Monte Carlo: Can we trust the third significant figure? , 2007, math/0703746.

[21]  J. Staudenmayer,et al.  Density Estimation in the Presence of Heteroskedastic Measurement Error , 2007 .

[22]  Richard H. Jones Analysis of repeated measures , 1992 .

[23]  D. Midthune,et al.  A population's distribution of Healthy Eating Index-2005 component scores can be estimated when more than one 24-hour recall is available. , 2010, The Journal of nutrition.

[24]  L. Natarajan Regression Calibration for Dichotomized Mismeasured Predictors , 2009, The international journal of biostatistics.

[25]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[26]  A Guolo,et al.  A Flexible Approach to Measurement Error Correction in Case–Control Studies , 2008, Biometrics.

[27]  Matt P. Wand,et al.  Finite sample performance of deconvolving density estimators , 1998 .

[28]  Alexander Meister,et al.  On deconvolution with repeated measurements , 2008 .

[29]  David Ruppert,et al.  Additive Partial Linear Models with Measurement Errors. , 2008, Biometrika.

[30]  Gary E Fraser,et al.  Correlations between estimated and true dietary intakes. , 2004, Annals of epidemiology.

[31]  Aurore Delaigle,et al.  An alternative view of the deconvolution problem , 2008 .

[32]  A. Carriquiry Assessing the prevalence of nutrient inadequacy , 1999, Public Health Nutrition.

[33]  R. Carroll,et al.  The International Journal of Biostatistics Fitting a Bivariate Measurement Error Model for Episodically Consumed Dietary Components , 2011 .

[34]  R. Carroll,et al.  A new statistical method for estimating the usual intake of episodically consumed foods with application to their distribution. , 2006, Journal of the American Dietetic Association.

[35]  Loki Natarajan,et al.  Maximum likelihood, multiple imputation and regression calibration for measurement error adjustment , 2008, Statistics in medicine.

[36]  Wayne A. Fuller,et al.  Estimating Usual Dietary Intake Distributions: Adjusting for Measurement Error and Nonnormality in 24-Hour Food Intake Data , 1997 .

[37]  C. Robert Simulation of truncated normal variables , 2009, 0907.4010.

[38]  Lynne Stokes Introduction to Variance Estimation , 2008 .

[39]  P. Levy Measurement Error and Misclassification in Statistics and Epidemiology: Impacts and Bayesian Adjustments , 2004 .