Exploratory Mediation Analysis with Many Potential Mediators

Social and behavioral scientists are increasingly employing technologies such as fMRI, smartphones, and gene sequencing, which yield ‘high-dimensional’ datasets with more columns than rows. There is increasing interest, but little substantive theory, in the role the variables in these data play in known processes. This necessitates exploratory mediation analysis, for which structural equation modeling is the benchmark method. However, this method cannot perform mediation analysis with more variables than observations. One option is to run a series of univariate mediation models, which incorrectly assumes independence of the mediators. Another option is regularization, but the available implementations may lead to high false-positive rates. In this article, we develop a hybrid approach which uses components of both filter and regularization: the ‘Coordinate-wise Mediation Filter’. It performs filtering conditional on the other selected mediators. We show through simulation that it improves performance over existing methods. Finally, we provide an empirical example, showing how our method may be used for epigenetic research.

[1]  Jian Huang,et al.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION. , 2011, The annals of applied statistics.

[2]  John J. McArdle,et al.  Regularized Structural Equation Modeling , 2015, Multivariate behavioral research.

[3]  Joshua N. Sampson,et al.  Testing multiple biological mediators simultaneously , 2014, Bioinform..

[4]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[5]  Xi Luo,et al.  Pathway Lasso: Estimate and Select Sparse Mediation Pathways with High Dimensional Mediators , 2016, 1603.07749.

[6]  Andreas M. Brandmaier,et al.  Variable selection in structural equation models with regularized MIMIC Models , 2018 .

[7]  T J VanderWeele,et al.  Mediation Analysis with Multiple Mediators , 2014, Epidemiologic methods.

[8]  Kristopher J Preacher,et al.  Advances in mediation analysis: a survey and synthesis of new developments. , 2015, Annual review of psychology.

[9]  Kristopher J Preacher,et al.  Statistical mediation analysis with a multicategorical independent variable. , 2014, The British journal of mathematical and statistical psychology.

[10]  Daniel L. Oberski,et al.  Shrinkage priors for Bayesian penalized regression , 2018 .

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Peter Richtárik,et al.  Iteration complexity of randomized block-coordinate descent methods for minimizing a composite function , 2011, Mathematical Programming.

[13]  Yves Rosseel,et al.  lavaan: An R Package for Structural Equation Modeling , 2012 .

[14]  Michael Braun sparseMVN: An R Package for Multivariate Normal Distributions with Sparse Covariance or Precision Matrices , 2013 .

[15]  Christoph Lange,et al.  PolyGEE: a generalized estimating equation approach to the efficient and robust estimation of polygenic effects in large-scale association studies , 2017, Biostatistics.

[16]  Kristopher J Preacher,et al.  Please Scroll down for Article Multivariate Behavioral Research Quantifying and Testing Indirect Effects in Simple Mediation Models When the Constituent Paths Are Nonlinear , 2022 .

[17]  Wei Zhang,et al.  Estimating and testing high-dimensional mediation effects in epigenetic studies , 2016, Bioinform..

[18]  Carl F. Falk,et al.  Model Specification Searches in Structural Equation Modeling with R , 2018 .

[19]  Martin J. Aryee,et al.  Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in Rheumatoid Arthritis , 2013, Nature Biotechnology.

[20]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[21]  Elizabeth L. Ogburn,et al.  High-dimensional multivariate mediation with application to neuroimaging data. , 2015, Biostatistics.

[22]  Martin A. Lindquist,et al.  Brain mediators of the effects of noxious heat on pain , 2014, PAIN®.

[23]  D. Mackinnon Introduction to Statistical Mediation Analysis , 2008 .

[24]  M. Sobel Some New Results on Indirect Effects and Their Standard Errors in Covariance Structure Models , 1986 .

[25]  S. West,et al.  A comparison of methods to test mediation and other intervening variable effects. , 2002, Psychological methods.

[26]  Kristopher J Preacher,et al.  Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models , 2008, Behavior research methods.

[27]  Jan R. Böhnke,et al.  Book Review: Explanation in Causal Inference: Methods for Mediation and Interaction , 2016, Quarterly journal of experimental psychology.

[28]  J. Horn A rationale and test for the number of factors in factor analysis , 1965, Psychometrika.

[29]  Kai Zhang,et al.  A Feature Sampling Strategy for Analysis of High Dimensional Genomic Data , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[30]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[31]  K. P. Singh,et al.  Support vector machines in water quality management. , 2011, Analytica chimica acta.

[32]  Sida I. Wang,et al.  Dropout Training as Adaptive Regularization , 2013, NIPS.

[33]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[34]  Trevor Hastie,et al.  Statistical Learning with Sparsity: The Lasso and Generalizations , 2015 .

[35]  Sarfaraz Serang,et al.  Exploratory Mediation Analysis via Regularization , 2017, Structural equation modeling : a multidisciplinary journal.

[36]  Tyler J. VanderWeele,et al.  Explanation in Causal Inference: Methods for Mediation and Interaction , 2015 .

[37]  René S. Kahn,et al.  Genome-wide DNA methylation levels and altered cortisol stress reactivity following childhood trauma in humans , 2016, Nature Communications.

[38]  M. McCloskey,et al.  Exploratory analysis of mediators of the relationship between childhood maltreatment and suicidal behavior. , 2018, Journal of adolescence.

[39]  Yurii Nesterov,et al.  Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems , 2012, SIAM J. Optim..

[40]  David P Mackinnon,et al.  Confidence Limits for the Indirect Effect: Distribution of the Product and Resampling Methods , 2004, Multivariate behavioral research.

[41]  Matthew S. Fritz,et al.  Mediation analysis. , 2019, Annual review of psychology.

[42]  Tsviya Olender,et al.  GeneCardsTM 2002: towards a complete, object-oriented, human gene compendium , 2002, Bioinform..