Genome-wide analyses of sparse mediation effects under composite null hypotheses

A genome-wide mediation analysis is conducted to investigate whether epigenetic variations M mediate the effect of socioeconomic adversity S on adiposity Y . The mediation effect can be expressed as a product of two parameters: the S-M association and the M -Y association conditional on S. We show that the joint significance test examining the two parameters separately has smaller p-values than the normality-based or the normal product-based test for the product and is a size α test. However, under multiple tests with sparse signals, the conventional joint significance test has a conservative test size and low power within a study because of the sparsity in signals and not accounting for the composition of different null hypotheses. We develop a novel test assessing the product of two normally distributed test statistics under a composite null hypothesis, where either one parameter is zero or both are zero. We show that the null composition can be adjusted by variances of test statistics without directly estimating proportions of different nulls. Advantages of the new test are illustrated in simulation and the epigenomic study. The new test identifies four methylation loci mediating the socioeconomic effect on adiposity with the false discovery rate less than 20% while existing methods had none surviving this cut-off.

[1]  Wei Zhang,et al.  Estimating and testing high-dimensional mediation effects in epigenetic studies , 2016, Bioinform..

[2]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects , 2005 .

[3]  Adam Drewnowski,et al.  Does social class predict diet quality? , 2008, The American journal of clinical nutrition.

[4]  M. Sobel Asymptotic Confidence Intervals for Indirect Effects in Structural Equation Models , 1982 .

[5]  A. Uitterlinden,et al.  Identification of a novel FGFRL1 MicroRNA target site polymorphism for bone mineral density in meta-analyses of genome-wide association studies. , 2015, Human molecular genetics.

[6]  Eric J. Tchetgen Tchetgen,et al.  On causal mediation analysis with a survival outcome. , 2011 .

[7]  Yen-Tsung Huang,et al.  Integrative modeling of multi‐platform genomic data under the framework of mediation analysis , 2015, Statistics in medicine.

[8]  Tyler J. VanderWeele,et al.  Conceptual issues concerning mediation, interventions and composition , 2009 .

[9]  Chien-Jen Chen,et al.  Fine Particle Pollution, Alanine Transaminase, and Liver Cancer: A Taiwanese Prospective Cohort Study (REVEAL-HBV). , 2016, Journal of the National Cancer Institute.

[10]  John G Bullock,et al.  Yes, But What's the Mechanism? (Don't Expect an Easy Answer) , 2010, Journal of personality and social psychology.

[11]  Richard T. Barfield,et al.  Testing for the indirect effect under the null for genome‐wide mediation analyses , 2017, Genetic epidemiology.

[12]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[13]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[14]  A. Hevener,et al.  Birthweight, mediating biomarkers and the development of type 2 diabetes later in life: a prospective study of multi-ethnic women , 2015, Diabetologia.

[15]  L. Keele,et al.  Identification, Inference and Sensitivity Analysis for Causal Mediation Effects , 2010, 1011.1079.

[16]  T. VanderWeele Bias Formulas for Sensitivity Analysis for Direct and Indirect Effects , 2010, Epidemiology.

[17]  Leo A. Aroian,et al.  The probability function of the product of two normally distributed variables. , 1947 .

[18]  Francesco Marabita,et al.  A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data , 2012, Bioinform..

[19]  Yen-Tsung Huang,et al.  Hypothesis test of mediation effect in causal mediation model with high‐dimensional continuous mediators , 2016, Biometrics.

[20]  C. Townsend,et al.  Reduced food intake and body weight in mice treated with fatty acid synthase inhibitors. , 2000, Science.

[21]  R. Berger,et al.  Bioequivalence trials, intersection-union tests and equivalence confidence sets , 1996 .

[22]  C. Eaton,et al.  Epigenome-wide profiling of DNA methylation in paired samples of adipose tissue and blood , 2016, Epigenetics.

[23]  Judea Pearl,et al.  Direct and Indirect Effects , 2001, UAI.

[24]  C. Eaton,et al.  Adiposity is associated with DNA methylation profile in adipose tissue. , 2015, International journal of epidemiology.

[25]  Andrew G Engel,et al.  Mutations in ZASP define a novel form of muscular dystrophy in humans , 2005, Annals of neurology.

[26]  Stijn Vansteelandt,et al.  Odds ratios for mediation analysis for a dichotomous outcome. , 2010, American journal of epidemiology.

[27]  S. West,et al.  A comparison of methods to test mediation and other intervening variable effects. , 2002, Psychological methods.

[28]  Tyler J VanderWeele,et al.  A three-way decomposition of a total effect into direct, indirect, and interactive effects. , 2013, Epidemiology.

[29]  B. Lindqvist,et al.  Estimating the proportion of true null hypotheses, with application to DNA microarray data , 2005 .

[30]  E. Loucks,et al.  Associations between childhood socioeconomic position and adulthood obesity. , 2009, Epidemiologic reviews.

[31]  W. McArdle,et al.  Associations with early-life socio-economic position in adult DNA methylation. , 2012, International journal of epidemiology.

[32]  J. Brug,et al.  A systematic review of studies on socioeconomic inequalities in dietary intakes associated with weight gain and overweight/obesity conducted among European adults , 2009, Obesity reviews : an official journal of the International Association for the Study of Obesity.

[33]  D. Mackinnon Introduction to Statistical Mediation Analysis , 2008 .

[34]  C. Eaton,et al.  Epigenetic Mediators Between Childhood Socioeconomic Disadvantage and Mid-Life Body Mass Index: The New England Family Study , 2016, Psychosomatic medicine.

[35]  Yen-Tsung Huang,et al.  Mediation analysis for survival data using semiparametric probit models , 2016, Biometrics.

[36]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[37]  L. Monserrat,et al.  A mutation in the Z‐line Cypher/ZASP protein is associated with arrhythmogenic right ventricular cardiomyopathy , 2015, Clinical genetics.

[38]  Tyler J VanderWeele,et al.  Causal Mediation Analysis With Survival Data , 2011, Epidemiology.

[39]  Theis Lange,et al.  Direct and Indirect Effects in a Survival Context , 2011, Epidemiology.