Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits

The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.The identification of the causal gene at a GWAS locus remains to be a challenging task. Here, using the SMR & HEIDI method to integrate GWAS, eQTL and mQTL data, Wu et al. map DNA methylation sites to the transcriptome and thereby prioritize functionally relevant genes for 12 human complex traits.

[1]  Jiang Qian,et al.  Transcription factors as readers and effectors of DNA methylation , 2016, Nature Reviews Genetics.

[2]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[3]  Yang Song,et al.  Therapeutic target database update 2012: a resource for facilitating target-oriented drug discovery , 2011, Nucleic Acids Res..

[4]  Jun S. Liu,et al.  Genetics of rheumatoid arthritis contributes to biology and drug discovery , 2013 .

[5]  Joseph E. Powell,et al.  The Brisbane Systems Genetics Study: Genetical Genomics Meets Complex Trait Genetics , 2012, PloS one.

[6]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[7]  M. Orlovic PMH30 UNDERSTANDING TRENDS IN DEMENTIA AND COGNITIVE IMPAIRMENT IN THE UNITED STATES: EVIDENCE FROM THE HEALTH AND RETIREMENT STUDY (HRS) , 2019, Value in Health.

[8]  A. Hofman,et al.  Identification of context-dependent expression quantitative trait loci in whole blood , 2016, Nature Genetics.

[9]  Łukasz M. Boryń,et al.  Genome-Wide Quantitative Enhancer Activity Maps Identified by STARR-seq , 2013, Science.

[10]  M. Peters,et al.  Systematic identification of trans eQTLs as putative drivers of known disease associations , 2013, Nature Genetics.

[11]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[12]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[13]  P. Visscher,et al.  Quantifying the mapping precision of genome-wide association studies using whole-genome sequencing data , 2017, Genome Biology.

[14]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[15]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[16]  Matthew E. Ritchie,et al.  A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data , 2009, Nucleic acids research.

[17]  F. Agakov,et al.  Abundant pleiotropy in human complex diseases and traits. , 2011, American journal of human genetics.

[18]  Joseph K. Pickrell,et al.  Detection and interpretation of shared genetic influences on 42 human traits , 2015, Nature Genetics.

[19]  Benjamin A. Logsdon,et al.  Gene Expression Elucidates Functional Impact of Polygenic Risk for Schizophrenia , 2016, Nature Neuroscience.

[20]  P. Visscher,et al.  Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index , 2015, Nature Genetics.

[21]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[22]  Luke R. Lloyd-Jones,et al.  The Genetic Architecture of Gene Expression in Peripheral Blood. , 2017, American journal of human genetics.

[23]  A. Folsom,et al.  The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. , 1989, American journal of epidemiology.

[24]  A. Dunning,et al.  Beyond GWASs: illuminating the dark road from association to function. , 2013, American journal of human genetics.

[25]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[26]  Judy H. Cho,et al.  Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations , 2015, Nature Genetics.

[27]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[28]  D. Kuonen Saddlepoint approximations for distributions of quadratic forms in normal variables , 1999 .

[29]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[30]  Tanya M. Teslovich,et al.  Discovery and refinement of loci associated with lipid levels , 2013, Nature Genetics.

[31]  Pau Farré,et al.  Additional annotation enhances potential for biologically-relevant analysis of the Illumina Infinium HumanMethylation450 BeadChip array , 2013, Epigenetics & Chromatin.

[32]  M. Daly,et al.  An Atlas of Genetic Correlations across Human Diseases and Traits , 2015, Nature Genetics.

[33]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[34]  Judith B. Zaugg,et al.  Genetic Control of Chromatin States in Humans Involves Local and Distal Chromosomal Interactions , 2015, Cell.

[35]  D. Balding,et al.  Epigenome-wide association studies for common human diseases , 2011, Nature Reviews Genetics.

[36]  Tom R. Gaunt,et al.  Epigenome-wide association of DNA methylation markers in peripheral blood from Indian Asians and Europeans with incident type 2 diabetes: a nested case-control study. , 2015, The lancet. Diabetes & endocrinology.

[37]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[38]  A. Stark,et al.  Transcriptional enhancers: from properties to genome-wide predictions , 2014, Nature Reviews Genetics.

[39]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[40]  M. Levine,et al.  DNA methylation-based measures of biological age: meta-analysis predicting time to death , 2016, Aging.

[41]  J. DeVoss,et al.  A Crohn’s disease variant in Atg16l1 enhances its degradation by caspase 3 , 2014, Nature.

[42]  Martin J. Aryee,et al.  Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in Rheumatoid Arthritis , 2013, Nature Biotechnology.

[43]  A. Singleton,et al.  Genetic variability in the regulation of gene expression in ten regions of the human brain , 2014, Nature Neuroscience.

[44]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[45]  R. Elkon,et al.  Functional genetic screens for enhancer elements in the human genome using CRISPR-Cas9 , 2016, Nature Biotechnology.

[46]  Peter M. Visscher,et al.  Fast set-based association analysis using summary data from GWAS identifies novel gene loci for human complex traits , 2016, Scientific Reports.

[47]  David A. Knowles,et al.  RNA splicing is a primary link between genetic variation and disease , 2016, Science.

[48]  Jonathan P. Beauchamp,et al.  Genome-wide association study identifies 74 loci associated with educational attainment , 2016, Nature.

[49]  Manolis Kellis,et al.  FTO Obesity Variant Circuitry and Adipocyte Browning in Humans. , 2015, The New England journal of medicine.

[50]  Tanya M. Teslovich,et al.  Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes , 2012, Nature Genetics.

[51]  Tom R. Gaunt,et al.  Epigenome-wide association study of body mass index, and the adverse outcomes of adiposity , 2016, Nature.

[52]  Bok-Ghee Han,et al.  Ubiquitous Polygenicity of Human Complex Traits: Genome-Wide Analysis of 49 Traits in Koreans , 2013, PLoS genetics.

[53]  A. Simopoulos Genetic variants in the metabolism of omega-6 and omega-3 fatty acids: their role in the determination of nutritional requirements and chronic disease risk , 2010, Experimental biology and medicine.

[54]  Amanda Sonnega,et al.  Cohort Profile: the Health and Retirement Study (HRS). , 2014, International journal of epidemiology.

[55]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[56]  J. Danesh,et al.  A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease , 2016 .

[57]  Alexander Gusev,et al.  Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights , 2016, Nature Genetics.

[58]  Dylan S. Small,et al.  A review of instrumental variable estimators for Mendelian randomization , 2015, Statistical methods in medical research.

[59]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[60]  K. Pollard,et al.  Enhancer–promoter interactions are encoded by complex genomic signatures on looping chromatin , 2016, Nature Genetics.

[61]  Olle Melander,et al.  From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus , 2010, Nature.

[62]  D. Schübeler Function and information content of DNA methylation , 2015, Nature.

[63]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[64]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[65]  喬 嶋本 Nested Case-control Study , 1992, Definitions.

[66]  M. O’Donovan,et al.  Pleiotropic effects of trait-associated genetic variation on DNA methylation: utility for refining , 2018 .

[67]  M. Daly,et al.  Genetic and Epigenetic Fine-Mapping of Causal Autoimmune Disease Variants , 2014, Nature.

[68]  Joel Eriksson,et al.  FTO genotype is associated with phenotypic variability of body mass index , 2012, Nature.

[69]  N. Cox,et al.  Obesity-associated variants within FTO form long-range functional connections with IRX3 , 2014, Nature.

[70]  Naomi R. Wray,et al.  Identification of 55,000 Replicated DNA Methylation QTL , 2017 .

[71]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[72]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[73]  N. Samani,et al.  Coronary Artery Disease–Associated LIPA Coding Variant rs1051338 Reduces Lysosomal Acid Lipase Levels and Activity in Lysosomes , 2017, Arteriosclerosis, thrombosis, and vascular biology.

[74]  Tom R. Gaunt,et al.  The UK10K project identifies rare variants in health and disease , 2016 .

[75]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[76]  Tamara S. Roman,et al.  New genetic loci link adipose and insulin biology to body fat distribution , 2014, Nature.

[77]  Eric S. Lander,et al.  Direct Identification of Hundreds of Expression-Modulating Variants using a Multiplexed Reporter Assay , 2016, Cell.

[78]  George Davey Smith,et al.  Mendelian randomization: Using genes as instruments for making causal inferences in epidemiology , 2008, Statistics in medicine.

[79]  R. Marioni,et al.  Identification of 55,000 Replicated DNA Methylation QTL , 2017, Scientific Reports.