Non-linear mediation analysis with high-dimensional mediators whose causal structure is unknown.

With multiple possible mediators on the causal pathway from a treatment to an outcome, we consider the problem of decomposing the effects along multiple possible causal path(s) through each distinct mediator. Under Pearl's path-specific effects framework (Pearl, 2001; Avin et al., 2005), such fine-grained decompositions necessitate stringent assumptions, such as correctly specifying the causal structure among the mediators, and no unobserved confounding among the mediators. In contrast, interventional direct and indirect effects for multiple mediators (Vansteelandt and Daniel, 2017) can be identified under much weaker conditions, while providing scientifically relevant causal interpretations. Nonetheless, current estimation approaches require (correctly) specifying a model for the joint mediator distribution, which can be difficult when there is a high-dimensional set of possibly continuous and non-continuous mediators. In this article, we avoid the need to model this distribution, by developing a definition of interventional effects previously suggested by VanderWeele and Tchetgen Tchetgen (2017) for longitudinal mediation. We propose a novel estimation strategy that uses non-parametric estimates of the (counterfactual) mediator distributions. Non-continuous outcomes can be accommodated using non-linear outcome models. Estimation proceeds via Monte Carlo integration. The procedure is illustrated using publicly available genomic data (Huang and Pan, 2016) to assess the causal effect of a microRNA expression on the three-month mortality of brain cancer patients that is potentially mediated by expression values of multiple genes.

[1]  Chen Avin,et al.  Identifiability of Path-Specific Effects , 2005, IJCAI.

[2]  James M. Robins,et al.  Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. , 2014, Epidemiology.

[3]  T J VanderWeele,et al.  Mediation Analysis with Multiple Mediators , 2014, Epidemiologic methods.

[4]  D. Mackinnon Contrasts in multiple mediator models. , 2000 .

[5]  J. Pearl The Causal Mediation Formula—A Guide to the Assessment of Pathways and Mechanisms , 2012, Prevention Science.

[6]  T. VanderWeele,et al.  Interventional Approach for Path-Specific Effects , 2017 .

[7]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[8]  Stijn Vansteelandt,et al.  Interventional Effects for Mediation Analysis with Multiple Mediators. , 2017, Epidemiology.

[9]  Judea Pearl,et al.  Direct and Indirect Effects , 2001, UAI.

[10]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[11]  L. Keele,et al.  A General Approach to Causal Mediation Analysis , 2010, Psychological methods.

[12]  J. Lok Causal organic direct and indirect effects: closer to Baron and Kenny , 2019, 1903.04697.

[13]  D. Mackinnon,et al.  The Correspondence Between Causal and Traditional Mediation Analysis: the Link Is the Mediator by Treatment Interaction , 2019, Prevention Science.

[14]  Joshua N. Sampson,et al.  FWER and FDR control when testing multiple mediators , 2018, Bioinform..

[15]  A. Belloni,et al.  Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011, 1201.0224.

[16]  A. Philip Dawid,et al.  Direct and Indirect Effects of Sequential Treatments , 2006, UAI.

[17]  Jeffrey M Albert,et al.  Generalized causal mediation and path analysis: Extensions and practical considerations , 2018, Statistical methods in medical research.

[18]  T. VanderWeele,et al.  Decomposition Analysis to Identify Intervention Targets for Reducing Disparities , 2017, Epidemiology.

[19]  T. VanderWeele,et al.  Mediation analysis with time varying exposures and mediators , 2017, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[20]  Kristopher J Preacher,et al.  Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models , 2008, Behavior research methods.

[21]  Tyler J. VanderWeele,et al.  On the Causal Interpretation of Race , 2014 .

[22]  S. Vansteelandt,et al.  Causal Mediation Analysis with Multiple Mediators , 2014, Biometrics.

[23]  George B. Ploubidis,et al.  Mediation Analysis With Intermediate Confounding: Structural Equation Modeling Viewed Through the Causal Inference Lens , 2014, American journal of epidemiology.

[24]  Louisa H Smith,et al.  Mediational E-values: Approximate sensitivity analysis for unmeasured mediator-outcome confounding. , 2019, Epidemiology.

[25]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[26]  J. Carlin,et al.  Understanding Interventional Effects: A More Natural Approach to Mediation Analysis? , 2018, Epidemiology.

[27]  Ilya Shpitser,et al.  Counterfactual Graphical Models for Longitudinal Mediation Analysis With Unobserved Confounding , 2012, Cogn. Sci..

[28]  Hongzhe Li,et al.  Inference for Individual Mediation Effects and Interventional Effects in Sparse High-Dimensional Causal Graphical Models , 2018, 1809.10652.

[29]  T. VanderWeele Policy-relevant proportions for direct effects. , 2013, Epidemiology.

[30]  Kara E. Rudolph,et al.  Nonparametric efficient causal mediation with intermediate confounders , 2019, 1912.09936.

[31]  Stijn Vansteelandt,et al.  Imputation Strategies for the Estimation of Natural Direct and Indirect Effects , 2012 .

[32]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[33]  Masataka Taguri,et al.  Causal mediation analysis with multiple causally non-ordered mediators , 2018, Statistical methods in medical research.

[34]  Wei Zhang,et al.  Estimating and testing high-dimensional mediation effects in epigenetic studies , 2016, Bioinform..

[35]  S. Vansteelandt,et al.  Flexible Mediation Analysis in the Presence of Nonlinear Relations: Beyond the Mediation Formula , 2013, Multivariate behavioral research.

[36]  Yen-Tsung Huang,et al.  Hypothesis test of mediation effect in causal mediation model with high‐dimensional continuous mediators , 2016, Biometrics.

[37]  Karl T Kelsey,et al.  Integrative Analysis of Micro‐RNA, Gene Expression, and Survival of Glioblastoma Multiforme , 2015, Genetic epidemiology.

[38]  Daniel L. Oberski,et al.  Exploratory Mediation Analysis with Many Potential Mediators , 2018, Structural Equation Modeling: A Multidisciplinary Journal.

[39]  Ian Schmid,et al.  Clarifying causal mediation analysis for the applied researcher: Defining effects based on what we want to learn. , 2019, Psychological methods.

[40]  S. Vansteelandt,et al.  Flexible Mediation Analysis With Multiple Mediators. , 2017, American journal of epidemiology.

[41]  Beatrijs Moerkerke,et al.  Disentangling indirect effects through multiple mediators without assuming any causal structure among the mediators. , 2019, Psychological methods.

[42]  David Benkeser,et al.  Nonparametric inference for interventional effects with multiple mediators , 2020, 2001.06027.

[43]  L. Thygesen,et al.  Assessing natural direct and indirect effects through multiple pathways. , 2014, American journal of epidemiology.

[44]  Xi Luo,et al.  Pathway Lasso: Estimate and Select Sparse Mediation Pathways with High Dimensional Mediators , 2016, 1603.07749.

[45]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[46]  Brian S. Caffo,et al.  Multimodal neuroimaging data integration and pathway analysis , 2019, Biometrics.

[47]  J. Pearl,et al.  Confounding and Collapsibility in Causal Inference , 1999 .

[48]  J. Böhnke Explanation in causal inference: Methods for mediation and interaction. , 2016, Quarterly journal of experimental psychology.

[49]  Martin A. Lindquist,et al.  Sparse principal component based high-dimensional mediation analysis , 2018, Comput. Stat. Data Anal..

[50]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[51]  J. Sampson,et al.  Group testing in mediation analysis , 2020, Statistics in medicine.