Estimation of Mediation Effect for High-dimensional Omics Mediators with Application to the Framingham Heart Study

Environmental exposures can regulate intermediate molecular phenotypes, such as gene expression, by different mechanisms and thereby lead to various health outcomes. It is of significant scientific interest to unravel the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposure and traits. Mediation analysis is an important tool for investigating such relationships. However, it has mainly focused on low-dimensional settings, and there is a lack of a good measure of the total mediation effect. Here, we extend an R-squared (Rsq) effect size measure, originally proposed in the single-mediator setting, to the moderate- and high-dimensional mediator settings in the mixed model framework. Based on extensive simulations, we compare our measure and estimation procedure with several frequently used mediation measures, including product, proportion, and ratio measures. Our Rsq measure has small bias and variance under the correctly specified model. To mitigate potential bias induced by non-mediators, we examine two variable selection procedures, i.e., iterative sure independence screening and false discovery rate control, to exclude the non-mediators. We evaluate the consistency of the proposed estimation procedures and introduce a resampling-based confidence interval. By applying the proposed estimation procedure, we find that more than half of the aging-related variations in systolic blood pressure can be explained by gene expression profiles in the Framingham Heart Study.

[1]  Jianqing Fan,et al.  Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[2]  Reese Baltes The Complex Nature of Unique and Shared Effects in Hierarchical Linear Regression : Implications for Developmental Psychology , 2001 .

[3]  T. VanderWeele Mediation Analysis: A Practitioner's Guide. , 2016, Annual review of public health.

[4]  Tanya M. Teslovich,et al.  The Influence of Age and Sex on Genetic Associations with Adult Body Size and Shape: A Large-Scale Genome-Wide Interaction Study , 2015, PLoS Genetics.

[5]  Kosuke Imai,et al.  Identification and Sensitivity Analysis for Multiple Causal Mechanisms: Revisiting Evidence from Framing Experiments , 2013, Political Analysis.

[6]  T J VanderWeele,et al.  Mediation Analysis with Multiple Mediators , 2014, Epidemiologic methods.

[7]  Andrew D. Johnson,et al.  Gene expression analysis of whole blood, peripheral blood mononuclear cells, and lymphoblastoid cell lines from the Framingham Heart Study. , 2012, Physiological genomics.

[8]  Kristopher J Preacher,et al.  Effect size measures for mediation models: quantitative strategies for communicating indirect effects. , 2011, Psychological methods.

[9]  C. McCulloch,et al.  Misspecifying the Shape of a Random Effects Distribution: Why Getting It Wrong May Not Matter , 2011, 1201.1980.

[10]  David P. MacKinnon,et al.  R2 effect-size measures for mediation analysis , 2009, Behavior research methods.

[11]  May E. Montasser,et al.  DNA Methylation Analysis Identifies Loci for Blood Pressure Regulation. , 2017, American journal of human genetics.

[12]  L. Wain,et al.  Molecular mechanisms underlying variations in lung function: a systems genetics analysis. , 2015, The Lancet. Respiratory medicine.

[13]  Richard Weindruch,et al.  Gene expression profiling of aging using DNA microarrays , 2002, Mechanisms of Ageing and Development.

[14]  Christian P. Robert,et al.  Large-scale inference , 2010 .

[15]  C. Ladd-Acosta,et al.  The role of epigenetics in genetic and environmental epidemiology. , 2016, Epigenomics.

[16]  João Pedro de Magalhães,et al.  Meta-analysis of age-related gene expression profiles identifies common signatures of aging , 2009, Bioinform..

[17]  D. Mackinnon,et al.  Statistical properties of four effect-size measures for mediation models , 2017, Behavior Research Methods.

[18]  D. Mackinnon Introduction to Statistical Mediation Analysis , 2008 .

[19]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[20]  Albert Hofman,et al.  Nucleotide Excision DNA Repair Is Associated With Age-Related Vascular Dysfunction , 2012, Circulation.

[21]  G. Torre-Amione Immune activation in chronic heart failure. , 2005, The American journal of cardiology.

[22]  Han Chen,et al.  A powerful and data‐adaptive test for rare‐variant–based gene‐environment interaction analysis , 2018, Statistics in medicine.

[23]  W Y Zhang,et al.  Discussion on `Sure independence screening for ultra-high dimensional feature space' by Fan, J and Lv, J. , 2008 .

[24]  Lei Sun,et al.  Reduction of selection bias in genomewide studies by resampling , 2005, Genetic epidemiology.

[25]  Kim Nimon,et al.  Using commonality analysis in multiple regressions: a tool to decompose regression effects in the face of multicollinearity , 2014 .

[26]  Andrew D. Johnson,et al.  A Meta-analysis of Gene Expression Signatures of Blood Pressure and Hypertension , 2015, PLoS genetics.

[27]  E. Colicino,et al.  DNA methylation in blood as a mediator of the association of mid-childhood body mass index with cardio-metabolic risk score in early adolescence , 2018, Epigenetics.

[28]  G. Verbeke,et al.  The effect of misspecifying the random-effects distribution in linear mixed models for longitudinal data , 1997 .

[29]  E. Kovacs,et al.  Clinical Interventions in Aging Dovepress the Aging Lung , 2022 .

[30]  Yen-Tsung Huang,et al.  Hypothesis test of mediation effect in causal mediation model with high‐dimensional continuous mediators , 2016, Biometrics.

[31]  Noel A Cressie,et al.  The asymptotic distribution of REML estimators , 1993 .

[32]  Robert D. McPhee,et al.  Commonality Analysis: A Method for Decomposing Explained Variance in Multiple Regression Analyses. , 1979 .

[33]  Wei Zhang,et al.  Estimating and testing high-dimensional mediation effects in epigenetic studies , 2016, Bioinform..

[34]  Dennis L. Sun,et al.  Exact post-selection inference, with application to the lasso , 2013, 1311.6238.

[35]  Taylor J. Maxwell,et al.  A Family‐Based Joint Test for Mean and Variance Heterogeneity for Quantitative Traits , 2015, Annals of human genetics.

[36]  Joshua N. Sampson,et al.  Testing multiple biological mediators simultaneously , 2014, Bioinform..