Comprehensive evaluation of disease- and trait-specific enrichment for eight functional elements among GWAS-identified variants

Genome-wide association study (GWAS)-identified variants are enriched for functional elements. However, we have limited knowledge of how functional enrichment may differ by disease/trait and tissue type. We tested a broad set of eight functional elements for enrichment among GWAS-identified SNPs (p < 5×10−8) from the NHGRI-EBI Catalog across seven disease/trait categories: cancer, cardiovascular disease, diabetes, autoimmune disease, psychiatric disease, neurological disease, and anthropometric traits. SNPs were annotated using HaploReg for the eight functional elements across any tissue: DNase sites, expression quantitative trait loci (eQTL), sequence conservation, enhancers, promoters, missense variants, sequence motifs, and protein binding sites. In addition, tissue-specific annotations were considered for brain vs. blood. Disease/trait SNPs were compared to a control set of 4809 SNPs matched to the GWAS SNPs (N = 1639) on allele frequency, gene density, distance to nearest gene, and linkage disequilibrium at ~3:1 ratio. Enrichment analyses were conducted using logistic regression, with Bonferroni correction. Overall, a significant enrichment was observed for all functional elements, except sequence motifs. Missense SNPs showed the strongest magnitude of enrichment. eQTLs were the only functional element significantly enriched across all diseases/traits. Magnitudes of enrichment were generally similar across diseases/traits, where enrichment was statistically significant. Blood vs. brain tissue effects on enrichment were dependent on disease/trait and functional element (e.g., cardiovascular disease: eQTLs PTissueDifference = 1.28 × 10−6 vs. enhancers PTissueDifference = 0.94). Identifying disease/trait-relevant functional elements and tissue types could provide new insight into the underlying biology, by guiding a priori GWAS analyses (e.g., brain enhancer elements for psychiatric disease) or facilitating post hoc interpretation.

[1]  M. Peters,et al.  Systematic identification of trans eQTLs as putative drivers of known disease associations , 2013, Nature Genetics.

[2]  Ayellet V. Segrè,et al.  Hundreds of variants clustered in genomic loci and biological pathways affect human height , 2010, Nature.

[3]  Luigi Ferrucci,et al.  Abundant Quantitative Trait Loci Exist for DNA Methylation and Gene Expression in Human Brain , 2010, PLoS genetics.

[4]  Joseph K. Pickrell Joint analysis of functional genomic data and genome-wide association studies of 18 human traits , 2013, bioRxiv.

[5]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[6]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[7]  L. Bierut,et al.  Cis-Expression Quantitative Trait Loci Mapping Reveals Replicable Associations with Heroin Addiction in OPRM1 , 2015, Biological Psychiatry.

[8]  Chris S Haley,et al.  The genomic signature of trait-associated variants , 2013, BMC Genomics.

[9]  T. Behrens,et al.  Using Gene Expression to Improve the Power of Genome-Wide Association Analysis , 2014, Human Heredity.

[10]  Eleazar Eskin,et al.  Incorporating prior information into association studies , 2012, Bioinform..

[11]  Buhm Han,et al.  Chromatin marks identify critical cell types for fine mapping complex trait variants , 2012 .

[12]  M C O'Donovan,et al.  Schizophrenia susceptibility alleles are enriched for alleles that affect gene expression in adult human brain , 2012, Molecular Psychiatry.

[13]  Gerome Breen,et al.  Using Functional Annotation for the Empirical Determination of Bayes Factors for Genome-Wide Association Study Analysis , 2011, PloS one.

[14]  N. Cox,et al.  Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS , 2010, PLoS genetics.

[15]  P. Deloukas,et al.  Common Regulatory Variation Impacts Gene Expression in a Cell Type–Dependent Manner , 2009, Science.

[16]  A. Singleton,et al.  Genetic variability in the regulation of gene expression in ten regions of the human brain , 2014, Nature Neuroscience.

[17]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[18]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[19]  Joel Hirschhorn,et al.  SNPsnap: a Web-based tool for identification and annotation of matched SNPs , 2015, Bioinform..

[20]  Manolis Kellis,et al.  HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants , 2011, Nucleic Acids Res..

[21]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[22]  Jiang Gui,et al.  Weighted multiple testing procedures for genomic studies , 2012, BioData Mining.

[23]  Edgar Erdfelder,et al.  G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences , 2007, Behavior research methods.

[24]  T. Furey,et al.  Integrating genetic and gene expression evidence into genome-wide association analysis of gene sets. , 2011, Genome research.

[25]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[26]  Mark I. McCarthy,et al.  Pancreatic islet enhancer clusters enriched in type 2 diabetes risk–associated variants , 2013, Nature Genetics.

[27]  Martin Renqiang Min,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[28]  Yoav Gilad,et al.  Expression quantitative trait loci detected in cell lines are often present in primary tissues. , 2009, Human molecular genetics.

[29]  Hongyu Zhao,et al.  GPA: A Statistical Approach to Prioritizing GWAS Results by Integrating Pleiotropy and Annotation , 2014, PLoS genetics.

[30]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[31]  E. Dermitzakis,et al.  Candidate Causal Regulatory Effects by Integration of Expression QTLs with Complex Trait Genetic Associations , 2010, PLoS genetics.

[32]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[33]  E. Gamazon,et al.  Enrichment of Cis-Regulatory Gene Expression SNPs and Methylation Quantitative Trait Loci Among Bipolar Disorder Susceptibility Variants , 2012, Molecular Psychiatry.

[34]  L. Liang,et al.  Using eQTL weights to improve power for genome-wide association studies: a genetic study of childhood asthma , 2013, Front. Genet..

[35]  L. Wasserman,et al.  Improving power in genome‐wide association studies: weights tip the scale , 2007, Genetic epidemiology.

[36]  Derek W Wright,et al.  Gateways to the FANTOM5 promoter level mammalian expression atlas , 2015, Genome Biology.

[37]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[38]  David J. Nott,et al.  Intra- and inter-individual genetic differences in gene expression , 2008, Mammalian Genome.