Leveraging brain cortex-derived molecular data to elucidate epigenetic and transcriptomic drivers of complex traits and disease

Integrative approaches that harness large-scale molecular datasets can help develop mechanistic insight into findings from genome-wide association studies (GWAS). We have performed extensive analyses to uncover transcriptional and epigenetic processes which may play a role in complex trait variation. This was undertaken by applying Bayesian multiple-trait colocalization systematically across the genome to identify genetic variants responsible for influencing intermediate molecular phenotypes as well as complex traits. In this analysis, we leveraged high-dimensional quantitative trait loci data derived from the prefrontal cortex tissue (concerning gene expression, DNA methylation and histone acetylation) and GWAS findings for five complex traits (Neuroticism, Schizophrenia, Educational Attainment, Insomnia and Alzheimer’s disease). There was evidence of colocalization for 118 associations, suggesting that the same underlying genetic variant influenced both nearby gene expression as well as complex trait variation. Of these, 73 associations provided evidence that the genetic variant also influenced proximal DNA methylation and/or histone acetylation. These findings support previous evidence at loci where epigenetic mechanisms may putatively mediate effects of genetic variants on traits, such as KLC1 and schizophrenia. We also uncovered evidence implicating novel loci in disease susceptibility, including genes expressed predominantly in the brain tissue, such as MDGA1, KIRREL3 and SLC12A5. An inverse relationship between DNA methylation and gene expression was observed more than can be accounted for by chance, supporting previous findings implicating DNA methylation as a transcriptional repressor. Our study should prove valuable in helping future studies prioritize candidate genes and epigenetic mechanisms for in-depth functional follow-up analyses.

[1]  Jonathan P. Beauchamp,et al.  Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals , 2018, Nature Genetics.

[2]  Jonathan P. Beauchamp,et al.  Genome-wide association study identifies 74 loci associated with educational attainment , 2016, Nature.

[3]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[4]  P. Donnelly,et al.  Genome-wide genetic data on ~500,000 UK Biobank participants , 2017, bioRxiv.

[5]  E. Pennisi Genomics. Inching toward the 3D genome. , 2015, Science.

[6]  McKenzie L. Shaw,et al.  DNA methylation of TOMM40-APOE-APOC2 in Alzheimer’s disease , 2018, Journal of Human Genetics.

[7]  Alessandro Bertolino,et al.  Translating genome-wide association findings into new therapeutics for psychiatry , 2016, Nature Neuroscience.

[8]  S. Hansen,et al.  Personality traits in bipolar disorder and influence on outcome , 2017, BMC Psychiatry.

[9]  A. Dunning,et al.  Beyond GWASs: illuminating the dark road from association to function. , 2013, American journal of human genetics.

[10]  I. Scheffer,et al.  Mutations in SLC12A5 in epilepsy of infancy with migrating focal seizures , 2015, Nature Communications.

[11]  Jing Tang,et al.  Identification of the gene signature reflecting schizophrenia’s etiology by constructing artificial intelligence‐based method of enhanced reproducibility , 2019, CNS neuroscience & therapeutics.

[12]  A. Quinlan BEDTools: The Swiss‐Army Tool for Genome Feature Analysis , 2014, Current protocols in bioinformatics.

[13]  Erdogan Taskesen,et al.  Functional mapping and annotation of genetic associations with FUMA , 2017, Nature Communications.

[14]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[15]  Wen Zhang,et al.  A Bayesian Framework for Multiple Trait Colocalization from Summary Association Statistics , 2017, bioRxiv.

[16]  Ellis Patrick,et al.  An xQTL map integrates the genetic architecture of the human brain’s transcriptome and epigenome , 2017, Nature Neuroscience.

[17]  V. Vacic,et al.  Genome-wide Analysis of Insomnia (N=1,331,010) Identifies Novel Loci and Functional Pathways , 2018, bioRxiv.

[18]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[19]  V. Golimbet,et al.  Profiling haplotype specific CpG and CpH methylation within a schizophrenia GWAS locus on chromosome 14 in schizophrenia and healthy subjects , 2020, Scientific Reports.

[20]  Gustavo Turecki,et al.  Methylomic profiling of human brain tissue supports a neurodevelopmental origin for schizophrenia , 2014, Genome Biology.

[21]  D. O'Leary,et al.  Radial Migration of Superficial Layer Cortical Neurons Controlled by Novel Ig Cell Adhesion Molecule MDGA1 , 2006, The Journal of Neuroscience.

[22]  Kathryn Demanelis,et al.  Co-occurring expression and methylation QTLs allow detection of common causal variants and shared biological mechanisms , 2018, Nature Communications.

[23]  Alkes L. Price,et al.  Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types , 2017 .

[24]  Mulin Jun Li,et al.  Nature Genetics Advance Online Publication a N a Ly S I S the Support of Human Genetic Evidence for Approved Drug Indications , 2022 .

[25]  Jon Wakefield,et al.  Bayes factors for genome‐wide association studies: comparison with P‐values , 2009, Genetic epidemiology.

[26]  S. Linnarsson,et al.  Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways , 2018, Nature Genetics.

[27]  Ayellet V. Segrè,et al.  Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation , 2018, Nature Genetics.

[28]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[29]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[30]  Ayellet V. Segrè,et al.  Colocalization of GWAS and eQTL Signals Detects Target Genes , 2016, bioRxiv.

[31]  Astrid Gall,et al.  Ensembl 2018 , 2017, Nucleic Acids Res..

[32]  Jonathan K. Pritchard,et al.  The Genetic and Mechanistic Basis for Variation in Gene Regulation , 2015, PLoS genetics.

[33]  Vladimir S Fonov,et al.  Network connectivity determines cortical thinning in early Parkinson’s disease progression , 2017, Nature Communications.

[34]  Ralf Herwig,et al.  ConsensusPathDB—a database for integrating human functional interaction networks , 2008, Nucleic Acids Res..

[35]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[36]  Evan Z. Macosko,et al.  Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types , 2017, Nature Genetics.

[37]  Stephen D. Turner,et al.  qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots , 2014, bioRxiv.

[38]  R. Marioni,et al.  GWAS on family history of Alzheimer’s disease , 2018, bioRxiv.

[39]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[40]  Tom R. Gaunt,et al.  Characterizing the Causal Pathway for Genetic Variants Associated with Neurological Phenotypes Using Human Brain-Derived Proteome Data , 2020, American journal of human genetics.

[41]  Sina A. Gharib,et al.  Identifying gene targets for brain-related traits using transcriptomic and methylomic data from blood , 2018, Nature Communications.

[42]  Pedro M. Valero-Mora,et al.  ggplot2: Elegant Graphics for Data Analysis , 2010 .

[43]  Alexander Gusev,et al.  Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights , 2016, Nature Genetics.

[44]  Tom R. Gaunt,et al.  Systematic Mendelian randomization framework elucidates hundreds of CpG sites which may mediate the influence of genetic variants on disease , 2018, Human molecular genetics.

[45]  E. Dermitzakis,et al.  Passive and active DNA methylation and the interplay with genetic variation in gene regulation , 2013, eLife.

[46]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[47]  Benjamin Neale,et al.  Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights , 2016 .

[48]  A. Wynshaw-Boris,et al.  Involvement of platelet‐activating factor and LIS1 in neuronal migration , 2003, The European journal of neuroscience.

[49]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[50]  Keegan D. Korthauer,et al.  Genome-wide repressive capacity of promoter DNA methylation is revealed through epigenomic manipulation , 2018, bioRxiv.

[51]  Ting Qi,et al.  Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits , 2018, Nature Communications.

[52]  Ellen M Wijsman,et al.  Comprehensive analysis of APOE and selected proximate markers for late-onset Alzheimer's disease: patterns of linkage disequilibrium and disease/marker association. , 2007, Genomics.

[53]  G. Fan,et al.  DNA Methylation and Its Basic Function , 2013, Neuropsychopharmacology.

[54]  M. O’Donovan,et al.  Pleiotropic effects of trait-associated genetic variation on DNA methylation: utility for refining , 2018 .

[55]  Daning Lu,et al.  Chromosome conformation elucidates regulatory relationships in developing human brain , 2016, Nature.

[56]  S. Muralidhar,et al.  The intellectual disability gene Kirrel3 regulates target-specific mossy fiber synapse development in the hippocampus , 2015, eLife.

[57]  E. Boerwinkle,et al.  Following-up genome-wide association study signals: lessons learned from Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study. , 2014, Circulation. Cardiovascular genetics.

[58]  P. Farnham,et al.  Frequent lack of repressive capacity of promoter DNA methylation identified through genome-wide epigenomic manipulation , 2017, bioRxiv.