Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types

Genetics can provide a systematic approach to discovering the tissues and cell types relevant for a complex disease or trait. Identifying these tissues and cell types is critical for following up on non-coding allelic function, developing ex-vivo models, and identifying therapeutic targets. Here, we analyze gene expression data from several sources, including the GTEx and PsychENCODE consortia, together with genome-wide association study (GWAS) summary statistics for 48 diseases and traits with an average sample size of 86,850, to identify disease-relevant tissues and cell types. We develop and apply an approach that uses stratified LD score regression to test whether disease heritability is enriched in regions surrounding genes with the highest specific expression in a given tissue. We detect tissue-specific enrichments at FDR < 5% for 30 diseases and traits across a broad range of tissues that recapitulate known biology. In our analysis of traits with observed central nervous system enrichment, we detect an enrichment of neurons over other brain cell types for several brain-related traits, enrichment of inhibitory neurons over excitatory neurons for bipolar disorder, and enrichments in the cortex for schizophrenia and in the striatum for migraine. In our analysis of traits with observed immunological enrichment, we identify enrichments of alpha beta T cells for asthma and eczema, B cells for primary biliary cirrhosis, and myeloid cells for lupus and Alzheimer’s disease. Our results demonstrate that our polygenic approach is a powerful way to leverage gene expression data for interpreting GWAS signal.

[1]  Nick C Fox,et al.  Analysis of shared heritability in common disorders of the brain , 2018, Science.

[2]  J. Biederman,et al.  Attention deficit/hyperactivity disorder across the lifespan. , 2002, Annual review of medicine.

[3]  Günter P. Wagner,et al.  Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples , 2012, Theory in Biosciences.

[4]  He Gao,et al.  Genome-wide association analysis identifies novel blood pressure loci and offers biological insights into cardiovascular risk , 2017, Nature Genetics.

[5]  Buhm Han,et al.  Chromatin marks identify critical cell types for fine mapping complex trait variants , 2012 .

[6]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[7]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[8]  Manolis Kellis,et al.  Joint Bayesian inference of risk variants and tissue-specific epigenomic enrichments across multiple complex human diseases , 2016, Nucleic acids research.

[9]  D. Rader,et al.  Inhibition of endothelial lipase causes increased HDL cholesterol levels in vivo. , 2003, The Journal of clinical investigation.

[10]  Xinli Hu,et al.  SNPsea: an algorithm to identify cell types, tissues and pathways affected by risk loci , 2014, Bioinform..

[11]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[12]  Y. Xing,et al.  A Transcriptome Database for Astrocytes, Neurons, and Oligodendrocytes: A New Resource for Understanding Brain Development and Function , 2008, The Journal of Neuroscience.

[13]  Kenneth R Feingold,et al.  The role of HDL in innate immunity1 , 2011, Journal of Lipid Research.

[14]  Casey S. Greene,et al.  International genome-wide meta-analysis identifies new primary biliary cirrhosis risk loci and targetable pathogenic pathways , 2015, Nature Communications.

[15]  C. Wijmenga,et al.  Gene expression analysis identifies global gene dosage sensitivity in cancer , 2015, Nature Genetics.

[16]  M. Heneka,et al.  Innate immunity in Alzheimer's disease , 2015, Nature Immunology.

[17]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[18]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[19]  N. Wray,et al.  Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance components analysis , 2015, Nature Genetics.

[20]  M. Edwards,et al.  Immune Dysfunction in Tourette Syndrome , 2013, Behavioural neurology.

[21]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[22]  David C. Wilson,et al.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2012, Nature.

[23]  Han Xu,et al.  Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. , 2014, American journal of human genetics.

[24]  Constantin Polychronakos,et al.  A Genome-Wide Meta-Analysis of Six Type 1 Diabetes Cohorts Identifies Multiple Associated Loci , 2011, PLoS genetics.

[25]  Jonathan P. Beauchamp,et al.  Genome-wide association study identifies 74 loci associated with educational attainment , 2016, Nature.

[26]  Tanya M. Teslovich,et al.  Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes , 2012, Nature Genetics.

[27]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[28]  Attention deficit hyperactivity disorder across the lifespan , 2017 .

[29]  D. Harrison,et al.  The immune system in hypertension. , 2014, Advances in physiology education.

[30]  C. Sabatti,et al.  Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort , 2015, Genetics.

[31]  Manolis Kellis,et al.  Conserved epigenomic signals in mice and humans reveal immune basis of Alzheimer’s disease , 2015, Nature.

[32]  E. Bullmore,et al.  Increased body mass index is associated with specific regional alterations in brain structure , 2016, International Journal of Obesity.

[33]  Roberto Lent,et al.  Isotropic Fractionator: A Simple, Rapid Method for the Quantification of Total Cell and Neuron Numbers in the Brain , 2005, The Journal of Neuroscience.

[34]  C. Enzinger,et al.  Burden of Risk Alleles for Hypertension Increases Risk of Intracerebral Hemorrhage , 2012, Stroke.

[35]  B. Bogerts,et al.  Acute schizophrenia is accompanied by reduced T cell and increased B cell immunity , 2010, European Archives of Psychiatry and Clinical Neuroscience.

[36]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[37]  N. Mehta Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. , 2011, Circulation. Cardiovascular genetics.

[38]  B. Neale,et al.  Linkage disequilibrium–dependent architecture of human complex traits shows action of negative selection , 2016, Nature Genetics.

[39]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[40]  Giulio Genovese,et al.  Increased burden of ultra-rare protein-altering variants among 4,877 individuals with schizophrenia , 2016, Nature Neuroscience.

[41]  B. Neale,et al.  Linkage disequilibrium dependent architecture of human complex traits reveals action of negative selection , 2016, bioRxiv.

[42]  ● Pytorch,et al.  Attention! , 1998, Trends in Cognitive Sciences.

[43]  K. Tsuneyama,et al.  B cell depletion therapy exacerbates murine primary biliary cirrhosis , 2011, Hepatology.

[44]  M. Daly,et al.  Integrating Autoimmune Risk Loci with Gene-Expression Data Identifies Specific Pathogenic Immune Cell Subsets. , 2011, American journal of human genetics.

[45]  Iuliana Ionita-Laza,et al.  TISSUE-SPECIFIC FUNCTIONAL EFFECT PREDICTION OF GENETIC VARIATION AND APPLICATIONS TO COMPLEX TRAIT GENETICS , 2016 .

[46]  Qian Wang,et al.  Systematic tissue-specific functional annotation of the human genome highlights immune-related DNA elements for late-onset Alzheimer’s disease , 2016, bioRxiv.

[47]  J. Rioux,et al.  Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus , 2015, Nature Genetics.

[48]  B. Berger,et al.  Efficient Bayesian mixed model analysis increases association power in large cohorts , 2014, Nature Genetics.

[49]  Michael R. Johnson,et al.  Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies , 2014, The Lancet Neurology.

[50]  Genomics implicates adaptive and innate immunity in Alzheimer’s and Parkinson’s , 2016 .

[51]  Claude Bouchard,et al.  A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance , 2012, Nature Genetics.

[52]  L. Becerra,et al.  Migraine attacks the Basal Ganglia , 2011, Molecular pain.

[53]  Alexander Gusev,et al.  Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits. , 2017, American journal of human genetics.

[54]  J. Soares,et al.  The Immunology of Bipolar Disorder , 2014, Neuroimmunomodulation.

[55]  S. Hodgkinson,et al.  Immune dysregulation and autoimmunity in bipolar disorder: Synthesis of the evidence and its clinical application , 2013, The Australian and New Zealand journal of psychiatry.

[56]  Benjamin D. Greenberg,et al.  Partitioning the Heritability of Tourette Syndrome and Obsessive Compulsive Disorder Reveals Differences in Genetic Architecture , 2013, PLoS genetics.

[57]  M. Daly,et al.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies , 2014, Nature Genetics.

[58]  M. Daly,et al.  Genetic and Epigenetic Fine-Mapping of Causal Autoimmune Disease Variants , 2014, Nature.

[59]  J. Mussini,et al.  [Immunology of multiple sclerosis]. , 1982, La semaine des hopitaux : organe fonde par l'Association d'enseignement medical des hopitaux de Paris.

[60]  Joris M. Mooij,et al.  MAGMA: Generalized Gene-Set Analysis of GWAS Data , 2015, PLoS Comput. Biol..

[61]  Lisa J. Martin,et al.  Meta-analysis of genome-wide association studies identifies 1q22 as a susceptibility locus for intracerebral hemorrhage. , 2014, American journal of human genetics.

[62]  Jun S. Liu,et al.  Genetics of rheumatoid arthritis contributes to biology and drug discovery , 2013 .

[63]  F. Benes,et al.  GABAergic Interneurons: Implications for Understanding Schizophrenia and Bipolar Disorder , 2001, Neuropsychopharmacology.

[64]  G. Hotamisligil,et al.  Inflammation and metabolic disorders , 2006, Nature.

[65]  C. Smith,et al.  γδ T cells promote inflammation and insulin resistance during high fat diet‐induced obesity in mice , 2015, Journal of leukocyte biology.

[66]  Michael J. Purcaro,et al.  The PsychENCODE project , 2015, Nature Neuroscience.

[67]  L. S. Lilly,et al.  Pathophysiology of Heart Disease: A Collaborative Project of Medical Students and Faculty , 2002 .

[68]  R. Coppola,et al.  Physiological dysfunction of the dorsolateral prefrontal cortex in schizophrenia revisited. , 2000, Cerebral cortex.

[69]  D. Koller,et al.  The Immunological Genome Project: networks of gene expression in immune cells , 2008, Nature Immunology.

[70]  N. Friedman,et al.  Perforin-Positive Dendritic Cells Exhibit an Immuno-regulatory Role in Metabolic Syndrome and Autoimmunity. , 2015, Immunity.

[71]  Joseph K. Pickrell Joint analysis of functional genomic data and genome-wide association studies of 18 human traits , 2013, bioRxiv.

[72]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[73]  Thomas Pap,et al.  Mechanisms of Disease: the molecular and cellular basis of joint destruction in rheumatoid arthritis , 2005, Nature Clinical Practice Rheumatology.

[74]  B. Pasaniuc,et al.  Contrasting the genetic architecture of 30 complex traits from summary association data , 2016, bioRxiv.

[75]  Tanya M. Teslovich,et al.  Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids , 2010, Nature.

[76]  D. Rader,et al.  Endothelial Lipase Promotes the Catabolism of ApoB-Containing Lipoproteins , 2004, Circulation research.

[77]  Jianxin Shi,et al.  Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs , 2013, Nature Genetics.

[78]  D. Boomsma,et al.  Stratified Linkage Disequilibrium Score Regression reveals enrichment of eQTL effects on complex traits is not tissue specific , 2017, bioRxiv.

[79]  O. Delaneau,et al.  Estimating the causal tissues for complex traits and diseases , 2016, Nature Genetics.

[80]  Daphne Koller,et al.  Polarization of the Effects of Autoimmune and Neurodegenerative Risk Alleles in Leukocytes , 2014, Science.

[81]  I. Mackay,et al.  AUTOIMMUNE, CHOLESTATIC AND BILIARY DISEASE Ongoing Activation of Autoantigen-Specific B Cells in Primary Biliary Cirrhosis , 2014 .

[82]  Sayan Mukherjee,et al.  Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia. , 2016, American journal of human genetics.

[83]  Howard Y. Chang,et al.  Lineage-specific and single cell chromatin accessibility charts human hematopoiesis and leukemia evolution , 2016, Nature Genetics.

[84]  Chuong B. Do,et al.  Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson’s disease , 2014, Nature Genetics.

[85]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[86]  H. Akiyama,et al.  Changes in density of calcium‐binding‐protein‐immunoreactive GABAergic neurons in prefrontal cortex in schizophrenia and bipolar disorder , 2008, Neuropathology : official journal of the Japanese Society of Neuropathology.

[87]  Christian Gieger,et al.  Genome-wide association analyses for lung function and chronic obstructive pulmonary disease identify new loci and potential druggable targets , 2017, Nature Genetics.

[88]  C. Lloyd,et al.  Functions of T cells in asthma: more than just TH2 cells , 2010, Nature Reviews Immunology.

[89]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[90]  J. Hirschhorn,et al.  Biological interpretation of genome-wide association studies using predicted gene functions , 2015, Nature Communications.

[91]  P. Koehler,et al.  One Hundred Years of Migraine Research: Major Clinical and Scientific Observations From 1910 to 2010 , 2011, Headache.

[92]  C. Sudlow,et al.  Genetic risk factors for ischaemic stroke and its subtypes (the METASTROKE Collaboration): a meta-analysis of genome-wide association studies , 2012, The Lancet Neurology.

[93]  N. Eriksson,et al.  Nature Genetics Advance Online Publication Meta-analysis of 375,000 Individuals Identifies 38 Susceptibility Loci for Migraine , 2022 .

[94]  G. Getz,et al.  Lymphotoxin ß Receptor–Dependent Control of Lipid Homeostasis , 2007, Science.

[95]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[96]  Giulio Genovese,et al.  Schizophrenia risk from complex variation of complement component 4 , 2016, Nature.

[97]  Yakir A Reshef,et al.  Partitioning heritability by functional annotation using genome-wide association summary statistics , 2015, Nature Genetics.

[98]  G. Hall,et al.  Cortical thickness in bipolar disorder: a systematic review , 2016, Bipolar disorders.

[99]  Jonathan P. Beauchamp,et al.  Genetic variants associated with subjective well-being, depressive symptoms and neuroticism identified through genome-wide analyses , 2016, Nature Genetics.

[100]  Ross M. Fraser,et al.  Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.

[101]  P. D. de Bakker,et al.  Genome‐wide meta‐analysis identifies novel multiple sclerosis susceptibility loci , 2011, Annals of neurology.

[102]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[103]  E. Eskin,et al.  Integrating Functional Data to Prioritize Causal Variants in Statistical Fine-Mapping Studies , 2014, PLoS genetics.

[104]  Sarah A. Gagliano,et al.  Genomics implicates adaptive and innate immunity in Alzheimer's and Parkinson's diseases , 2016, bioRxiv.

[105]  P. Deloukas,et al.  Multiple common variants for celiac disease influencing immune gene expression , 2010, Nature Genetics.

[106]  K. Hao,et al.  A common haplotype lowers PU.1 expression in myeloid cells and delays onset of Alzheimer's disease , 2017, Nature Neuroscience.

[107]  R. Xavier,et al.  Unravelling the pathogenesis of inflammatory bowel disease , 2007, Nature.