Variance component tests of multivariate mediation effects under composite null hypotheses

Mediation effects of multiple mediators are determined by two associations: one between an exposure and mediators ( S - M ) and the other between the mediators and an outcome conditional on the exposure ( M - Y ). The test for mediation effects is conducted under a composite null hypothesis, that is, either one of the S - M and M - Y associations is zero or both are zeros. Without accounting for the composite null, the type 1 error rate within a study containing a large number of multimediator tests may be much less than the expected. We propose a novel test to address the issue. For each mediation test j , j = 1 , … , J , we examine the S - M and M - Y associations using two separate variance component tests. Assuming a zero-mean working distribution with a common variance for the element-wise S - M (and M - Y ) associations, score tests for the variance components are constructed. We transform the test statistics into two normally distributed statistics under the null. Using a recently developed result, we conduct J hypothesis tests accounting for the composite null hypothesis by adjusting for the variances of the normally distributed statistics for the S - M and M - Y associations. Advantages of the proposed test over other methods are illustrated in simulation studies and a data application where we analyze lung cancer data from The Cancer Genome Atlas to investigate the smoking effect on gene expression through DNA methylation in 15 114 genes.

[1]  Tyler J. VanderWeele,et al.  Conceptual issues concerning mediation, interventions and composition , 2009 .

[2]  Richard T. Barfield,et al.  Testing for the indirect effect under the null for genome‐wide mediation analyses , 2017, Genetic epidemiology.

[3]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[4]  Xihong Lin Variance component testing in generalised linear models with random effects , 1997 .

[5]  T J VanderWeele,et al.  Mediation Analysis with Multiple Mediators , 2014, Epidemiologic methods.

[6]  Yen-Tsung Huang,et al.  Hypothesis test of mediation effect in causal mediation model with high‐dimensional continuous mediators , 2016, Biometrics.

[7]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[8]  Tyler J VanderWeele,et al.  Causal Mediation Analysis With Survival Data , 2011, Epidemiology.

[9]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[10]  Theis Lange,et al.  Direct and Indirect Effects in a Survival Context , 2011, Epidemiology.

[11]  S. West,et al.  A comparison of methods to test mediation and other intervening variable effects. , 2002, Psychological methods.

[12]  Satterthwaite Fe An approximate distribution of estimates of variance components. , 1946 .

[13]  Xihong Lin,et al.  JOINT ANALYSIS OF SNP AND GENE EXPRESSION DATA IN GENETIC ASSOCIATION STUDIES OF COMPLEX DISEASES. , 2014, The annals of applied statistics.

[14]  B. Lindqvist,et al.  Estimating the proportion of true null hypotheses, with application to DNA microarray data , 2005 .

[15]  D. Mackinnon Introduction to Statistical Mediation Analysis , 2008 .

[16]  Hongzhe Li,et al.  Web-based Supplementary Materials for “ More powerful genetic association testing via a new statistical framework for integrative genomics ” , 2014 .

[17]  Yen-Tsung Huang,et al.  Genome-wide analyses of sparse mediation effects under composite null hypotheses , 2019, The Annals of Applied Statistics.

[18]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[19]  Stijn Vansteelandt,et al.  Odds ratios for mediation analysis for a dichotomous outcome. , 2010, American journal of epidemiology.

[20]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[21]  R. Davies The distribution of a linear combination of 2 random variables , 1980 .

[22]  Yen-Tsung Huang,et al.  Mediation analysis for survival data using semiparametric probit models , 2016, Biometrics.

[23]  Z. Ying,et al.  A resampling method based on pivotal estimating functions , 1994 .

[24]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[25]  Tianxi Cai,et al.  Semiparametric regression analysis for clustered failure time data , 2000 .

[26]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects , 2005 .

[27]  L. Keele,et al.  Identification, Inference and Sensitivity Analysis for Causal Mediation Effects , 2010, 1011.1079.

[28]  S. Vansteelandt,et al.  Causal Mediation Analysis with Multiple Mediators , 2014, Biometrics.

[29]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[30]  Yen-Tsung Huang Joint significance tests for mediation effects of socioeconomic adversity on adiposity via epigenetics , 2018, Annals of Applied Statistics.

[31]  R. Berger,et al.  Bioequivalence trials, intersection-union tests and equivalence confidence sets , 1996 .