A Framework for Mediation Analysis with Multiple Exposures, Multivariate Mediators, and Non-Linear Response Models

Mediation analysis seeks to identify and quantify the paths by which an exposure affects an outcome. Intermediate variables which are effected by the exposure and which effect the outcome are known as mediators. There exists extensive work on mediation analysis in the context of models with a single mediator and continuous and binary outcomes. However these methods are often not suitable for multi-omic data that include highly interconnected variables measuring biological mechanisms and various types of outcome variables such as censored survival responses. In this article, we develop a general framework for causal mediation analysis with multiple exposures, multivariate mediators, and continuous, binary, and survival responses. We estimate mediation effects on several scales including the mean difference, odds ratio, and restricted mean scale as appropriate for various outcome models. Our estimation method avoids imposing constraints on model parameters such as the rare disease assumption while accommodating continuous exposures. We evaluate the framework and compare it to other methods in extensive simulation studies by assessing bias, type I error and power at a range of sample sizes, disease prevalences, and number of false mediators. Using Kidney Renal Clear Cell Carcinoma data from The Cancer Genome Atlas, we identify proteins which mediate the effect of metabolic gene expression on survival. Software for implementing this unified framework is made available in an R package (this https URL).

[1]  The Cancer Genome Atlas Research Network COMPREHENSIVE MOLECULAR CHARACTERIZATION OF CLEAR CELL RENAL CELL CARCINOMA , 2013, Nature.

[2]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[3]  Veerabhadran Baladandayuthapani,et al.  Personalized Integrated Network Modeling of the Cancer Proteome Atlas , 2018, Scientific Reports.

[4]  James M. Robins,et al.  Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. , 2014, Epidemiology.

[5]  T. M. Murali,et al.  XTalkDB: a database of signaling pathway crosstalk , 2016, Nucleic Acids Res..

[6]  Ilya Shpitser,et al.  Semiparametric Theory for Causal Mediation Analysis: efficiency bounds, multiple robustness, and sensitivity analysis. , 2012, Annals of statistics.

[7]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[8]  Yen-Tsung Huang,et al.  Hypothesis test of mediation effect in causal mediation model with high‐dimensional continuous mediators , 2016, Biometrics.

[9]  A A Tsiatis,et al.  Causal Inference on the Difference of the Restricted Mean Lifetime Between Two Groups , 2001, Biometrics.

[10]  Xihong Lin,et al.  Mediation analysis for common binary outcomes , 2018, Statistics in medicine.

[11]  W. Linehan,et al.  Metabolic Pathways in Kidney Cancer: Current Therapies and Future Directions. , 2018, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[12]  Prahlad T. Ram,et al.  A pan-cancer proteomic perspective on The Cancer Genome Atlas , 2014, Nature Communications.

[13]  Linda Valeri,et al.  Marginal Time-Dependent Causal Effects in Mediation Analysis With Survival Data. , 2019, American journal of epidemiology.

[14]  Tyler J. VanderWeele,et al.  Conceptual issues concerning mediation, interventions and composition , 2009 .

[15]  Stijn Vansteelandt,et al.  Odds ratios for mediation analysis for a dichotomous outcome. , 2010, American journal of epidemiology.

[16]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of clear cell renal cell carcinoma , 2013, Nature.

[17]  L. Keele,et al.  Identification, Inference and Sensitivity Analysis for Causal Mediation Effects , 2010, 1011.1079.

[18]  D. Dash,et al.  Integrating transcriptome and proteome profiling: Strategies and applications , 2016, Proteomics.

[19]  L. Keele,et al.  A General Approach to Causal Mediation Analysis , 2010, Psychological methods.

[20]  Judea Pearl,et al.  Direct and Indirect Effects , 2001, UAI.

[21]  Chen Avin,et al.  Identifiability of Path-Specific Effects , 2005, IJCAI.

[22]  Martin A. Lindquist,et al.  Sparse principal component based high-dimensional mediation analysis , 2018, Comput. Stat. Data Anal..

[23]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[24]  J. Pearl Causal inference in statistics: An overview , 2009 .

[25]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[26]  R. Akbani,et al.  Personalized Network Modeling of the Pan-Cancer Patient and Cell Line Interactome , 2019, bioRxiv.

[27]  M. Pape,et al.  Role of reversible phosphorylation of acetyl‐CoA carboxylase in long‐chain fatty acid synthesis 1 , 1989, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[28]  Xihong Lin,et al.  JOINT ANALYSIS OF SNP AND GENE EXPRESSION DATA IN GENETIC ASSOCIATION STUDIES OF COMPLEX DISEASES. , 2014, The annals of applied statistics.

[29]  Yoshiaki Uyama,et al.  Moving beyond the hazard ratio in quantifying the between-group difference in survival analysis. , 2014, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[30]  D. Hardie Regulation of fatty acid synthesis via phosphorylation of acetyl-CoA carboxylase. , 1989, Progress in lipid research.

[31]  J. Cronan,et al.  Overproduction of Acetyl-CoA Carboxylase Activity Increases the Rate of Fatty Acid Biosynthesis in Escherichia coli * , 2000, The Journal of Biological Chemistry.

[32]  Yuan Ji,et al.  TCGA-Assembler 2: Software Pipeline for Retrieval and Processing of TCGA/CPTAC Data , 2017, bioRxiv.

[33]  Richard T. Barfield,et al.  Testing for the indirect effect under the null for genome‐wide mediation analyses , 2017, Genetic epidemiology.