Missing data in FFQs: making assumptions about item non-response

OBJECTIVE FFQs are a popular method of capturing dietary information in epidemiological studies and may be used to derive dietary exposures such as nutrient intake or overall dietary patterns and diet quality. As FFQs can involve large numbers of questions, participants may fail to respond to all questions, leaving researchers to decide how to deal with missing data when deriving intake measures. The aim of the present commentary is to discuss the current practice for dealing with item non-response in FFQs and to propose a research agenda for reporting and handling missing data in FFQs. RESULTS Single imputation techniques, such as zero imputation (assuming no consumption of the item) or mean imputation, are commonly used to deal with item non-response in FFQs. However, single imputation methods make strong assumptions about the missing data mechanism and do not reflect the uncertainty created by the missing data. This can lead to incorrect inference about associations between diet and health outcomes. Although the use of multiple imputation methods in epidemiology has increased, these have seldom been used in the field of nutritional epidemiology to address missing data in FFQs. We discuss methods for dealing with item non-response in FFQs, highlighting the assumptions made under each approach. CONCLUSIONS Researchers analysing FFQs should ensure that missing data are handled appropriately and clearly report how missing data were treated in analyses. Simulation studies are required to enable systematic evaluation of the utility of various methods for handling item non-response in FFQs under different assumptions about the missing data mechanism.

[1]  Carol J Boushey,et al.  The Dietary Patterns Methods Project: synthesis of findings across cohorts and relevance to dietary guidance. , 2015, The Journal of nutrition.

[2]  Victor Kipnis,et al.  Dealing with dietary measurement error in nutritional cohort studies. , 2011, Journal of the National Cancer Institute.

[3]  M. Kenward,et al.  Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls , 2009, BMJ : British Medical Journal.

[4]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .

[5]  V. Burley,et al.  Development, validation and utilisation of food-frequency questionnaires – a review , 2002, Public Health Nutrition.

[6]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[7]  Stephen R Cole,et al.  Use of multiple imputation in the epidemiologic literature. , 2008, American journal of epidemiology.

[8]  E. Forsum,et al.  Strengthening the Reporting of Observational Studies in Epidemiology – nutritional epidemiology (STROBE‐nut): An extension of the STROBE statement† , 2016, Nutrition bulletin.

[9]  W. L. Beeson,et al.  Missing Data in a Long Food Frequency Questionnaire: Are Imputed Zeroes Correct? , 2009, Epidemiology.

[10]  P. Laake,et al.  Comparing methods for handling missing values in food-frequency questionnaires and proposing k nearest neighbours imputation: effects on dietary intake in the Norwegian Women and Cancer study (NOWAC) , 2008, Public Health Nutrition.

[11]  Rosa Abellana Sangra,et al.  The identification, impact and management of missing values and outlier data in nutritional epidemiology , 2015 .

[12]  K. Ball,et al.  Three-year change in diet quality and associated changes in BMI among schoolchildren living in socio-economically disadvantaged neighbourhoods , 2014, British Journal of Nutrition.

[13]  P. Wallström,et al.  What do review papers conclude about food and dietary patterns? , 2013, Food & nutrition research.

[14]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[15]  D. Midthune,et al.  Social desirability trait influences on self-reported dietary measures among diverse participants in a multicenter multiple risk factor trial. , 2008, The Journal of nutrition.

[16]  S. Tretli,et al.  Dietary fat and the risk of breast cancer: A prospective study of 25,892 Norwegian women , 1995, International journal of cancer.

[17]  I. White,et al.  A toolkit for measurement error correction, with a focus on nutritional epidemiology , 2014, Statistics in medicine.

[18]  W. Willett,et al.  Self-Administered Semiquantitative Food Frequency Questionnaires: Patterns, Predictors, and Interpretation of Omitted Items , 2009, Epidemiology.

[19]  Gary Fraser,et al.  Guided Multiple Imputation of Missing Data: Using a Subsample to Strengthen the Missing-at-Random Assumption , 2007, Epidemiology.

[20]  Patrick Royston,et al.  Multiple imputation using chained equations: Issues and guidance for practice , 2011, Statistics in medicine.

[21]  S. Pocock,et al.  The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies , 2007, The Lancet.

[22]  I. White,et al.  Multiple imputation of multiple multi-item scales when a full imputation model is infeasible , 2016, BMC Research Notes.

[23]  Y. Ahn,et al.  Item non-responses in mailed food frequency questionnaires in a Korean male cancer cohort study. , 2006, Asia Pacific journal of clinical nutrition.

[24]  Mark Woodward,et al.  Analysis of the Benefits of a Mediterranean Diet in the GISSI-Prevenzione Study: A Case Study in Imputation of Missing Values from Repeated Measurements , 2005, European Journal of Epidemiology.

[25]  M. Orlich,et al.  Vegetarian dietary patterns and mortality in Adventist Health Study 2. , 2013, JAMA internal medicine.

[26]  T. Byers,et al.  Effects of social approval bias on self-reported fruit and vegetable consumption: a randomized controlled trial , 2008, Nutrition Journal.

[27]  E. Riboli,et al.  Validation and calibration of food-frequency questionnaire measurements in the Northern Sweden Health and Disease cohort , 2002, Public Health Nutrition.

[28]  P. van’t Veer,et al.  Design characteristics of food frequency questionnaires in relation to their validity. , 2007, American journal of epidemiology.

[29]  Raymond J Carroll,et al.  Structure of dietary measurement error: results of the OPEN biomarker study. , 2003, American journal of epidemiology.

[30]  J. Sabaté,et al.  Vegetarian Dietary Patterns Are Associated With a Lower Risk of Metabolic Syndrome , 2011, Diabetes Care.

[31]  M. Galanti,et al.  Diet-Associated Risks of Disease and Self-Reported Food Consumption: How Shall We Treat Partial Nonresponse in a Food Frequency Questionnaire? , 2000, Nutrition and cancer.

[32]  V. Burley,et al.  Food-frequency questionnaires: a review of their design, validation and utilisation , 2004, Nutrition Research Reviews.

[33]  Laurence S Freedman,et al.  Addressing Current Criticism Regarding the Value of Self-Report Dietary Data. , 2015, The Journal of nutrition.