Bayesian shrinkage estimation of high dimensional causal mediation effects in omics studies.

Causal mediation analysis aims to examine the role of a mediator or a group of mediators that lie in the pathway between an exposure and an outcome. Recent biomedical studies often involve a large number of potential mediators based on high-throughput technologies. Most of the current analytic methods focus on settings with one or a moderate number of potential mediators. With the expanding growth of -omics data, joint analysis of molecular-level genomics data with epidemiological data through mediation analysis is becoming more common. However, such joint analysis requires methods that can simultaneously accommodate high-dimensional mediators and that are currently lacking. To address this problem, we develop a Bayesian inference method using continuous shrinkage priors to extend previous causal mediation analysis techniques to a high-dimensional setting. Simulations demonstrate that our method improves the power of global mediation analysis compared to simpler alternatives and has decent performance to identify true non-null contributions to the mediation effects of the pathway. The Bayesian method also helps us to understand the structure of the composite null cases for inactive mediators in the pathway. We applied our method to Multi-Ethnic Study of Atherosclerosis (MESA) and identified DNA methylation regions that may actively mediate the effect of socioeconomic status (SES) on cardiometabolic outcomes. This article is protected by copyright. All rights reserved.

[1]  Yen-Tsung Huang,et al.  Hypothesis test of mediation effect in causal mediation model with high‐dimensional continuous mediators , 2016, Biometrics.

[2]  A. Baccarelli,et al.  The Role of DNA Methylation in Cardiovascular Risk and Disease: Methodological Aspects, Study Design, and Data Analysis for Epidemiological Studies. , 2016, Circulation research.

[3]  T J VanderWeele,et al.  Mediation Analysis with Multiple Mediators , 2014, Epidemiologic methods.

[4]  Bhramar Mukherjee,et al.  Life course socioeconomic status and DNA methylation in genes related to stress reactivity and inflammation: The multi-ethnic study of atherosclerosis , 2015, Epigenetics.

[5]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[6]  Michael J. Daniels,et al.  A framework for Bayesian nonparametric inference for causal effects of mediation , 2017, Biometrics.

[7]  M. Stephens,et al.  Integrated Enrichment Analysis of Variants and Pathways in Genome-Wide Association Studies Indicates Central Role for IL-2 Signaling Genes in Type 1 Diabetes, and Cytokine Signaling Genes in Crohn's Disease , 2013, PLoS genetics.

[8]  Michael R Elliott,et al.  Bayesian inference for causal mediation effects using principal stratification with dichotomous mediators and outcomes. , 2010, Biostatistics.

[9]  Stijn Vansteelandt,et al.  Odds ratios for mediation analysis for a dichotomous outcome. , 2010, American journal of epidemiology.

[10]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[11]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[12]  J. Pankow,et al.  Glycated hemoglobin, diabetes, and cardiovascular risk in nondiabetic adults. , 2010, The New England journal of medicine.

[13]  T. VanderWeele Mediation Analysis: A Practitioner's Guide. , 2016, Annual review of public health.

[14]  Yang Ni,et al.  Polygenic prediction via Bayesian regression and continuous shrinkage priors , 2018, Nature Communications.

[15]  S. Vansteelandt,et al.  Causal Mediation Analysis with Multiple Mediators , 2014, Biometrics.

[16]  Elizabeth L. Ogburn,et al.  High-dimensional multivariate mediation with application to neuroimaging data. , 2015, Biostatistics.

[17]  D. Rubin Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment , 1980 .

[18]  Xiang Zhou,et al.  Polygenic Modeling with Bayesian Sparse Linear Mixed Models , 2012, PLoS genetics.

[19]  H. Kuo,et al.  Association Among Acrylamide, Blood Insulin, and Insulin Resistance in Adults , 2009, Diabetes Care.

[20]  Bhramar Mukherjee,et al.  Neighborhood characteristics influence DNA methylation of genes involved in stress response and inflammation: The Multi-Ethnic Study of Atherosclerosis , 2017, Epigenetics.

[21]  Corwin M Zigler,et al.  BAYESIAN METHODS FOR MULTIPLE MEDIATORS: RELATING PRINCIPAL STRATIFICATION AND CAUSAL MEDIATION IN THE ANALYSIS OF POWER PLANT EMISSION CONTROLS. , 2019, The annals of applied statistics.

[22]  R. Kronmal,et al.  Multi-Ethnic Study of Atherosclerosis: objectives and design. , 2002, American journal of epidemiology.

[23]  L. Keele,et al.  Identification, Inference and Sensitivity Analysis for Causal Mediation Effects , 2010, 1011.1079.

[24]  Masataka Taguri,et al.  Causal mediation analysis with multiple causally non-ordered mediators , 2018, Statistical methods in medical research.

[25]  M. Stephens,et al.  Bayesian variable selection regression for genome-wide association studies and other large-scale problems , 2011, 1110.6019.

[26]  Tyler J VanderWeele,et al.  Principal Stratification -- Uses and Limitations , 2011, The international journal of biostatistics.

[27]  Marshall M Joffe,et al.  A review of causal estimation of effects in mediation analyses , 2012, Statistical methods in medical research.

[28]  Debbie A Lawlor,et al.  Association Analysis of 29,956 Individuals Confirms That a Low-Frequency Variant at CCND2 Halves the Risk of Type 2 Diabetes by Enhancing Insulin Secretion , 2015, Diabetes.

[29]  Ting-Huei Chen,et al.  High dimensional mediation analysis with latent variables , 2019, Biometrics.