Theory and performance of substitution models for estimating relative causal effects in nutritional epidemiology

ABSTRACT Background Estimating relative causal effects (i.e., “substitution effects”) is a common aim of nutritional research. In observational data, this is usually attempted using 1 of 2 statistical modeling approaches: the leave-one-out model and the energy partition model. Despite their widespread use, there are concerns that neither approach is well understood in practice. Objectives We aimed to explore and illustrate the theory and performance of the leave-one-out and energy partition models for estimating substitution effects in nutritional epidemiology. Methods Monte Carlo data simulations were used to illustrate the theory and performance of both the leave-one-out model and energy partition model, by considering 3 broad types of causal effect estimands: 1) direct substitutions of the exposure with a single component, 2) inadvertent substitutions of the exposure with several components, and 3) average relative causal effects of the exposure instead of all other dietary sources. Models containing macronutrients, foods measured in calories, and foods measured in grams were all examined. Results The leave-one-out and energy partition models both performed equally well when the target estimand involved substituting a single exposure with a single component, provided all variables were measured in the same units. Bias occurred when the substitution involved >1 substituting component. Leave-one-out models that examined foods in mass while adjusting for total energy intake evaluated obscure estimands. Conclusions Regardless of the approach, substitution models need to be constructed from clearly defined causal effect estimands. Estimands involving a single exposure and a single substituting component are typically estimated more accurately than estimands involving more complex substitutions. The practice of examining foods measured in grams or portions while adjusting for total energy intake is likely to deliver obscure relative effect estimands with unclear interpretations.

[1]  Kellyn F Arnold,et al.  Adjustment for energy intake in nutritional research: a causal inference perspective , 2021, medRxiv.

[2]  Kellyn F Arnold,et al.  Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations , 2020, International journal of epidemiology.

[3]  E. Rimm,et al.  Food substitution models for nutritional epidemiology. , 2020, The American journal of clinical nutrition.

[4]  E. Murray,et al.  Compositional data call for complex interventions. , 2020, International journal of epidemiology.

[5]  Kellyn F Arnold,et al.  A causal inference perspective on the analysis of compositional data , 2020, International journal of epidemiology.

[6]  A. Goto,et al.  Association of soy and fermented soy product intake with total and cause specific mortality: prospective cohort study , 2020, BMJ.

[7]  K. Overvad,et al.  Statistical models in nutritional epidemiology: more focus on the interpretation and argumentation for variable selection. , 2019, The American journal of clinical nutrition.

[8]  S. Schlesinger,et al.  Dietary sugars and cardiometabolic risk factors: a network meta-analysis on isocaloric substitution interventions. , 2019, The American journal of clinical nutrition.

[9]  Dorothy D. Sears,et al.  Sugar-sweetened beverages and colorectal cancer risk in the California Teachers Study , 2019, PloS one.

[10]  M. Touvier,et al.  Sugary drink consumption and risk of cancer: results from NutriNet-Santé prospective cohort , 2019, BMJ.

[11]  E. Rimm,et al.  Association of changes in red meat consumption with total and cause specific mortality among US women and men: two prospective cohort studies , 2019, BMJ.

[12]  M. Touvier,et al.  Ultra-processed food intake and risk of cardiovascular disease: prospective cohort study (NutriNet-Santé) , 2019, BMJ.

[13]  E. Giovannucci,et al.  Substitution analysis in nutritional epidemiology: proceed with caution , 2018, European Journal of Epidemiology.

[14]  Maciej Liskiewicz,et al.  Robust causal inference using Directed Acyclic Graphs: the R package , 2018 .

[15]  W. Willett,et al.  Isotemporal substitution paradigm for physical activity epidemiology and weight change. , 2009, American journal of epidemiology.

[16]  A. Minihane,et al.  Update on trans fatty acids and health: Position statement by the Scientific Advisory Committee on Nutrition (SACN) , 2007 .

[17]  D Spiegelman,et al.  Dietary fat and coronary heart disease: a comparison of approaches for adjusting for total energy intake and modeling repeated dietary measurements. , 1999, American journal of epidemiology.

[18]  W C Willett,et al.  Adjustment for total energy intake in epidemiologic studies. , 1997, The American journal of clinical nutrition.

[19]  S Wacholder,et al.  Interpretation of energy adjustment models for nutritional epidemiology. , 1993, American journal of epidemiology.

[20]  V. Saliba,et al.  Public Health in England. , 1988, Public health.

[21]  M. Singer,et al.  Nutritional Epidemiology , 2020, Definitions.

[22]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .