Quantifying genetic effects on disease mediated by assayed gene expression levels

Disease variants identified by genome-wide association studies (GWAS) tend to overlap with expression quantitative trait loci (eQTLs). However, it remains unclear whether this overlap is driven by mediation of genetic effects on disease by expression levels, or whether it primarily reflects pleiotropic relationships instead. Here we introduce a new method, mediated expression score regression (MESC), to estimate disease heritability mediated by the cis-genetic component of assayed steady-state gene expression levels, using summary association statistics from GWAS and eQTL studies. We show that MESC produces robust estimates of expression-mediated heritability across a wide range of simulations. We applied MESC to GWAS summary statistics for 42 diseases and complex traits (average N = 323K) and cis-eQTL data across 48 tissues from the GTEx consortium. We determined that a statistically significant but low proportion of disease heritability (mean estimate 11% with S.E. 2%) is mediated by the cis-genetic component of assayed gene expression levels, with substantial variation across diseases (point estimates from 0% to 38%). We further partitioned expression-mediated heritability across various gene sets. We observed an inverse relationship between cis-heritability of expression and disease heritability mediated by expression, suggesting that genes with weaker eQTLs have larger causal effects on disease. Moreover, we observed broad patterns of expression-mediated heritability enrichment across functional gene sets that implicate specific gene sets in disease, including loss-of-function intolerant genes and FDA-approved drug targets. Our results demonstrate that steady-state expression levels in bulk tissues are informative of regulatory disease mechanisms, while also motivating additional assays to more fully capture the regulatory effects of GWAS variants.

[1]  Ian Jackson,et al.  Variants of the melanocyte–stimulating hormone receptor gene are associated with red hair and fair skin in humans , 1995, Nature Genetics.

[2]  Colm O'Dushlaine,et al.  INRICH: interval-based enrichment analysis for genome-wide association studies , 2012, Bioinform..

[3]  Po-Ru Loh,et al.  Estimating the proportion of disease heritability mediated by gene expression levels , 2017, bioRxiv.

[4]  Todd L Edwards,et al.  Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics , 2018, Nature Communications.

[5]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[6]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[7]  M. Daly,et al.  Genetic and Epigenetic Fine-Mapping of Causal Autoimmune Disease Variants , 2014, Nature.

[8]  David A. Knowles,et al.  RNA splicing is a primary link between genetic variation and disease , 2016, Science.

[9]  M. Bucan,et al.  From Mouse to Human: Evolutionary Genomics Analysis of Human Orthologs of Essential Genes , 2013, PLoS genetics.

[10]  P. Donnelly,et al.  The UK Biobank resource with deep phenotyping and genomic data , 2018, Nature.

[11]  J. Hirschhorn,et al.  Biological interpretation of genome-wide association studies using predicted gene functions , 2015, Nature Communications.

[12]  C. Bustamante,et al.  Melanesian Blond Hair Is Caused by an Amino Acid Change in TYRP1 , 2012, Science.

[13]  Valeriia Haberland,et al.  The MR-Base platform supports systematic causal inference across the human phenome , 2018, eLife.

[14]  Jakob Grove,et al.  Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection , 2018, Nature Genetics.

[15]  P. Sachs,et al.  SMARCAD1 ATPase activity is required to silence endogenous retroviruses in embryonic stem cells , 2019, Nature Communications.

[16]  N. Patterson,et al.  Polygenicity of complex traits is explained by negative selection , 2018, bioRxiv.

[17]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[18]  T. Heskes,et al.  The statistical properties of gene-set analysis , 2016, Nature Reviews Genetics.

[19]  A. Zwinderman,et al.  Multi-tissue transcriptome analyses identify genetic mechanisms underlying neuropsychiatric traits , 2019, Nature Genetics.

[20]  Jeffery M. Meyer,et al.  A transcriptome-wide association study of 229,000 women identifies new candidate susceptibility genes for breast cancer , 2018, Nature Genetics.

[21]  C. Spearman The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[22]  Hongyu Zhao,et al.  A statistical framework for cross-tissue transcriptome-wide association analysis , 2018, Nature Genetics.

[23]  Luke R. Lloyd-Jones,et al.  Signatures of negative selection in the genetic architecture of human complex traits , 2018, Nature Genetics.

[24]  B. Berger,et al.  Efficient Bayesian mixed model analysis increases association power in large cohorts , 2014, Nature Genetics.

[25]  Nikolaos A Patsopoulos,et al.  Limited statistical evidence for shared genetic effects of eQTLs and autoimmune disease-associated loci in three major immune cell types , 2017, Nature Genetics.

[26]  J. Pritchard,et al.  Evidence for Weak Selective Constraint on Human Gene Expression , 2018, Genetics.

[27]  O. Delaneau,et al.  Estimating the causal tissues for complex traits and diseases , 2016, Nature Genetics.

[28]  David E Hill,et al.  Dynamic Role of trans Regulation of Gene Expression in Relation to Complex Traits. , 2017, American journal of human genetics.

[29]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[30]  Po-Ru Loh,et al.  Mixed-model association for biobank-scale datasets , 2018, Nature Genetics.

[31]  Matthew Stephens,et al.  Large-scale genome-wide enrichment analyses identify new trait-associated genes and pathways across 31 human phenotypes , 2018, Nature Communications.

[32]  Sangsoo Kim,et al.  Efficient pathway enrichment and network analysis of GWAS summary data using GSA-SNP2 , 2018, Nucleic acids research.

[33]  Ayellet V. Segrè,et al.  Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation , 2018, Nature Genetics.

[34]  Annie W Shieh,et al.  Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder , 2018, Science.

[35]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[36]  Ayellet V. Segrè,et al.  Colocalization of GWAS and eQTL Signals Detects Target Genes , 2016, bioRxiv.

[37]  Warren W. Kretzschmar,et al.  Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression , 2017, Nature Genetics.

[38]  Snæbjörn Pálsson,et al.  Genetic determinants of hair, eye and skin pigmentation in Europeans , 2007, Nature Genetics.

[39]  Snæbjörn Pálsson,et al.  Two newly identified genetic determinants of pigmentation in Europeans , 2008, Nature Genetics.

[40]  Steven J. M. Jones,et al.  The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery , 2016, Cell.

[41]  Benjamin A. Logsdon,et al.  Gene expression imputation across multiple brain regions provides insights into schizophrenia risk , 2019, Nature Genetics.

[42]  Peter Kraft,et al.  Transcriptome‐wide association studies accounting for colocalization using Egger regression , 2018, Genetic epidemiology.

[43]  Jake Siegel,et al.  Genetics of trans-regulatory variation in gene expression , 2017, bioRxiv.

[44]  F. Hormozdiari,et al.  Genes with High Network Connectivity Are Enriched for Disease Heritability. , 2019, American journal of human genetics.

[45]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[46]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[47]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[48]  Benjamin J. Strober,et al.  Dynamic genetic regulation of gene expression during cellular differentiation , 2019, Science.

[49]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[50]  L. Kruglyak,et al.  The role of regulatory variation in complex traits and disease , 2015, Nature Reviews Genetics.

[51]  Thawfeek M. Varusai,et al.  The Reactome Pathway Knowledgebase , 2017, Nucleic acids research.

[52]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[53]  Keith Lawson,et al.  Evaluation and Design of Genome-wide CRISPR/Cas9 Knockout Screens , 2017, bioRxiv.

[54]  D. Licatalosi,et al.  FMRP Stalls Ribosomal Translocation on mRNAs Linked to Synaptic Function and Autism , 2011, Cell.

[55]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[56]  David P. Nusinow,et al.  Estimating the Selective Effects of Heterozygous Protein Truncating Variants from Human Exome Data , 2017, Nature Genetics.

[57]  G. Robinson That BLUP is a Good Thing: The Estimation of Random Effects , 1991 .

[58]  G. Davey Smith,et al.  Evaluating the potential role of pleiotropy in Mendelian randomization studies , 2018, Human molecular genetics.

[59]  M. Owen,et al.  Expression quantitative trait loci in the developing human brain and their enrichment in neuropsychiatric disorders , 2018, Genome Biology.

[60]  D. Goldstein,et al.  Enhancer redundancy predicts gene pathogenicity and informs complex disease gene discovery , 2018, bioRxiv.

[61]  Alexander Gusev,et al.  A transcriptome-wide association study of high grade serous epithelial ovarian cancer identifies novel susceptibility genes and splice variants , 2019, Nature Genetics.

[62]  D. Durocher,et al.  Evaluation and Design of Genome-Wide CRISPR/SpCas9 Knockout Screens , 2017, G3: Genes, Genomes, Genetics.

[63]  M. Daly,et al.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies , 2014, Nature Genetics.

[64]  C. Hartl,et al.  Genetic control of gene expression and splicing in the developing human brain , 2018 .

[65]  Adan Valladares-Salgado,et al.  Cross-tissue and tissue-specific eQTLs: partitioning the heritability of a complex trait. , 2014, American journal of human genetics.

[66]  Alexander Gusev,et al.  Functional Architectures of Local and Distal Regulation of Gene Expression in Multiple Human Tissues. , 2017, American journal of human genetics.

[67]  Benjamin D. Greenberg,et al.  Partitioning the Heritability of Tourette Syndrome and Obsessive Compulsive Disorder Reveals Differences in Genetic Architecture , 2013, PLoS genetics.

[68]  Stephan J Sanders,et al.  A framework for the interpretation of de novo mutation in human disease , 2014, Nature Genetics.

[69]  Joris M. Mooij,et al.  MAGMA: Generalized Gene-Set Analysis of GWAS Data , 2015, PLoS Comput. Biol..

[70]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[71]  P. Visscher,et al.  10 Years of GWAS Discovery: Biology, Function, and Translation. , 2017, American journal of human genetics.

[72]  C. Hartl,et al.  Genetic Control of Expression and Splicing in Developing Human Brain Informs Disease Mechanisms , 2019, Cell.

[73]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[74]  Zoltán Kutalik,et al.  Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits , 2019, Nature Communications.

[75]  E. Lander,et al.  Identification and characterization of essential genes in the human genome , 2015, Science.

[76]  A. Gusev,et al.  Probabilistic fine-mapping of transcriptome-wide association studies , 2017, Nature Genetics.

[77]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[78]  Michael R. Johnson,et al.  Re-evaluation of SNP heritability in complex human traits , 2016, Nature Genetics.

[79]  Yakir A Reshef,et al.  Partitioning heritability by functional annotation using genome-wide association summary statistics , 2015, Nature Genetics.

[80]  R. Durbin,et al.  Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses , 2012, Nature Protocols.

[81]  Michael Wainberg,et al.  Opportunities and challenges for transcriptome-wide association studies , 2019, Nature Genetics.

[82]  M. G. van der Wijst,et al.  Single-cell RNA sequencing identifies cell type-specific cis-eQTLs and co-expression QTLs , 2018, Nature Genetics.

[83]  B. Neale,et al.  Linkage disequilibrium dependent architecture of human complex traits reveals action of negative selection , 2016, bioRxiv.

[84]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[85]  A. Halim,et al.  Mechanism Action of Platelets and Crucial Blood Coagulation Pathways in Hemostasis , 2017, International journal of hematology-oncology and stem cell research.

[86]  Alkes L. Price,et al.  Quantification of frequency-dependent genetic architectures in 25 UK Biobank traits reveals action of negative selection , 2019, Nature Communications.

[87]  E. Boerwinkle,et al.  dbNSFP v2.0: A Database of Human Non‐synonymous SNVs and Their Functional Predictions and Annotations , 2013, Human mutation.

[88]  Noah Zaitlen,et al.  Ultra-rare variants drive substantial cis-heritability of human gene expression , 2019, Nature Genetics.

[89]  Alkes L Price,et al.  Reconciling S-LDSC and LDAK functional enrichment estimates , 2019, Nature Genetics.

[90]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[91]  Judith A. Blake,et al.  The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics , 2010, Nucleic Acids Res..

[92]  Helen E. Parkinson,et al.  The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog) , 2016, Nucleic Acids Res..

[93]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[94]  C. Spearman The proof and measurement of association between two things. By C. Spearman, 1904. , 1987, The American journal of psychology.

[95]  Alkes L. Price,et al.  Single-Tissue and Cross-Tissue Heritability of Gene Expression Via Identity-by-Descent in Related or Unrelated Individuals , 2011, PLoS genetics.

[96]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[97]  S. Kumar,et al.  Chromatin three-dimensional interactions mediate genetic effects on gene expression , 2019, Science.

[98]  O. MacDougald,et al.  Regulation of bone mass by Wnt signaling. , 2006, The Journal of clinical investigation.

[99]  Marcus Krüger,et al.  Genetic compensation induced by deleterious mutations but not gene knockdowns , 2015, Nature.

[100]  Evan Z. Macosko,et al.  Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types , 2017, Nature Genetics.

[101]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[102]  Ayellet V. Segrè,et al.  Common Inherited Variation in Mitochondrial Genes Is Not Enriched for Associations with Type 2 Diabetes or Related Glycemic Traits , 2010, PLoS genetics.

[103]  Yakir A Reshef,et al.  Leveraging molecular quantitative trait loci to understand the genetic architecture of diseases and complex traits , 2018, Nature Genetics.

[104]  Michael J. Gloudemans,et al.  Abundant associations with gene expression complicate GWAS follow-up , 2019, Nature Genetics.

[105]  Alexander Gusev,et al.  Large-scale transcriptome-wide association study identifies new prostate cancer risk regions , 2018, Nature Communications.

[106]  N. Patterson,et al.  Extreme Polygenicity of Complex Traits Is Explained by Negative Selection. , 2019, American journal of human genetics.

[107]  Sina A. Gharib,et al.  Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis , 2018, bioRxiv.