Co-occurring expression and methylation QTLs allow detection of common causal variants and shared biological mechanisms

Inherited genetic variation affects local gene expression and DNA methylation in humans. Most expression quantitative trait loci (cis-eQTLs) occur at the same genomic location as a methylation QTL (cis-meQTL), suggesting a common causal variant and shared mechanism. Using DNA and RNA from peripheral blood of Bangladeshi individuals, here we use co-localization methods to identify eQTL-meQTL pairs likely to share a causal variant. We use partial correlation and mediation analyses to identify >400 of these pairs showing evidence of a causal relationship between expression and methylation (i.e., shared mechanism) with many additional pairs we are underpowered to detect. These co-localized pairs are enriched for SNPs showing opposite associations with expression and methylation, although many SNPs affect multiple CpGs in opposite directions. This work demonstrates the pervasiveness of co-regulated expression and methylation in the human genome. Applying this approach to other types of molecular QTLs can enhance our understanding of regulatory mechanisms.Most expression QTLs (eQTLs) co-occur with a DNA methylation QTL (meQTL), suggesting a common causal variant. Here the authors analyse DNA and RNA from blood and identify eQTL-meQTL pairs likely to share a causal variant, finding that expression and methylation are often genetically co-regulated.

[1]  F. Parvez,et al.  Arsenic exposure from drinking water, and all-cause and chronic-disease mortalities in Bangladesh (HEALS): a prospective cohort study , 2010, The Lancet.

[2]  Casey S. Greene,et al.  International genome-wide meta-analysis identifies new primary biliary cirrhosis risk loci and targetable pathogenic pathways , 2015, Nature Communications.

[3]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[4]  Mark I. McCarthy,et al.  Global analysis of DNA methylation variation in adipose tissue from twins reveals links to disease-associated variants in distal regulatory elements. , 2013, American journal of human genetics.

[5]  M. Daly,et al.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies , 2014, Nature Genetics.

[6]  Jun S. Liu,et al.  Genetics of rheumatoid arthritis contributes to biology and drug discovery , 2013 .

[7]  L. Kruglyak,et al.  The role of regulatory variation in complex traits and disease , 2015, Nature Reviews Genetics.

[8]  Wen Zhang,et al.  A Bayesian Framework for Multiple Trait Colocalization from Summary Association Statistics , 2017, bioRxiv.

[9]  Pau Farré,et al.  Additional annotation enhances potential for biologically-relevant analysis of the Illumina Infinium HumanMethylation450 BeadChip array , 2013, Epigenetics & Chromatin.

[10]  Ayellet V. Segrè,et al.  Colocalization of GWAS and eQTL Signals Detects Target Genes , 2016, bioRxiv.

[11]  Jonathan K. Pritchard,et al.  The Genetic and Mechanistic Basis for Variation in Gene Regulation , 2015, PLoS genetics.

[12]  M. Sobel Direct and Indirect Effects in Linear Structural Equation Models , 1987 .

[13]  J. Baron,et al.  Genome-Wide Association Study Identifies Chromosome 10q24.32 Variants Associated with Arsenic Metabolism and Toxicity Phenotypes in Bangladesh , 2012, PLoS genetics.

[14]  G. Abecasis,et al.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes , 2010, Genetic epidemiology.

[15]  J. Baron,et al.  Baseline comorbidities in a skin cancer prevention trial in Bangladesh , 2013, European journal of clinical investigation.

[16]  X. Wen,et al.  Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization , 2016, bioRxiv.

[17]  T. Spector,et al.  The Presence of Methylation Quantitative Trait Loci Indicates a Direct Genetic Influence on the Level of DNA Methylation in Adipose Tissue , 2013, PloS one.

[18]  J. Baron,et al.  Arsenic metabolism efficiency has a causal role in arsenic toxicity: Mendelian randomization and gene-environment interaction. , 2013, International journal of epidemiology.

[19]  Mathieu Blanchette,et al.  The relationship between DNA methylation, genetic and expression inter-individual variation in untransformed human fibroblasts , 2014, Genome Biology.

[20]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[21]  Peter Langfelder,et al.  Genetic analysis of DNA methylation and gene expression levels in whole blood of healthy human subjects , 2012, BMC Genomics.

[22]  Mary D. Fortune,et al.  Integration of disease association and eQTL data using a Bayesian colocalisation approach highlights six candidate causal genes in immune-mediated diseases , 2015, Human molecular genetics.

[23]  Jonathan K. Pritchard,et al.  Identification of Genetic Variants That Affect Histone Modifications in Human Cells , 2013, Science.

[24]  Lorenzo Richiardi,et al.  Mediation analysis in epidemiology: methods, interpretation and bias. , 2013, International journal of epidemiology.

[25]  Andrey A. Shabalin,et al.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations , 2011, Bioinform..

[26]  David A. Knowles,et al.  RNA splicing is a primary link between genetic variation and disease , 2016, Science.

[27]  Stefan Enroth,et al.  Identification of genetic variants influencing the human plasma proteome , 2013, Proceedings of the National Academy of Sciences.

[28]  Magda Tsolaki,et al.  Identification of cis-regulatory variation influencing protein abundance levels in human plasma. , 2012, Human molecular genetics.

[29]  Joseph K. Pickrell,et al.  DNaseI sensitivity QTLs are a major determinant of human expression variation , 2011, Nature.

[30]  Alexis Battle,et al.  Impact of regulatory variation from RNA to protein , 2015, Science.

[31]  E. Dermitzakis,et al.  Passive and active DNA methylation and the interplay with genetic variation in gene regulation , 2013, eLife.

[32]  Lude Franke,et al.  Mediation Analysis Demonstrates That Trans-eQTLs Are Often Explained by Cis-Mediation: A Genome-Wide Analysis among 1,800 South Asians , 2014, PLoS genetics.

[33]  Roderic Guigó,et al.  Identification of genetic variants associated with alternative splicing using sQTLseekeR , 2014, Nature Communications.

[34]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[35]  Jiang Qian,et al.  Transcription factors as readers and effectors of DNA methylation , 2016, Nature Reviews Genetics.

[36]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[37]  Joseph K. Pickrell,et al.  Detection and interpretation of shared genetic influences on 42 human traits , 2015, Nature Genetics.

[38]  Luigi Ferrucci,et al.  GeMes, clusters of DNA methylation under genetic control, can inform genetic and epigenetic analysis of disease. , 2014, American journal of human genetics.

[39]  John D. Blischak,et al.  Methylation QTLs Are Associated with Coordinated Changes in Transcription Factor Binding, Histone Modifications, and Gene Expression Levels , 2014, bioRxiv.

[40]  Mathieu Lemire,et al.  Long-range epigenetic regulation is conferred by genetic variation located at thousands of independent loci , 2015, Nature Communications.

[41]  T. Hudson,et al.  Genome-wide search for exonic variants affecting translational efficiency , 2013, Nature Communications.