Local genetic correlation gives insights into the shared genetic architecture of complex traits

Although genetic correlations between complex traits provide valuable insights into epidemiological and etiological studies, a precise quantification of which genomic regions contribute to the genome-wide genetic correlation is currently lacking. Here, we introduce ρ-HESS, a technique to quantify the correlation between pairs of traits due to genetic variation at a small region in the genome. Our approach only requires GWAS summary data and makes no distributional assumption on the causal variant effects sizes while accounting for linkage disequilibrium (LD) and overlapping GWAS samples. We analyzed large-scale GWAS summary data across 35 complex traits, and identified 27 genomic regions that contribute significantly to the genetic correlation among these traits. Notably, we find 7 genomic regions that contribute to the genetic correlation of 12 pairs of traits that show negligible genome-wide correlation, further showcasing the power of local genetic correlation analyses. Finally, we leverage the distribution of local genetic correlations across the genome to assign putative direction of causality for 15 pairs of traits.

[1]  Judy H. Cho,et al.  Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations , 2015, Nature Genetics.

[2]  G. Davey Smith,et al.  Mendelian randomization: genetic anchors for causal inference in epidemiological studies , 2014, Human molecular genetics.

[3]  Ross M. Fraser,et al.  Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.

[4]  F. Dohan More on celiac disease as a model for schizophrenia. , 1983, Biological psychiatry.

[5]  Tanya M. Teslovich,et al.  Discovery and refinement of loci associated with lipid levels , 2013, Nature Genetics.

[6]  Joseph K. Pickrell,et al.  Detection and interpretation of shared genetic influences on 42 human traits , 2015, Nature Genetics.

[7]  Christian Gieger,et al.  New gene functions in megakaryopoiesis and platelet formation , 2011, Nature.

[8]  G. Vogler,et al.  Methodology for genetic studies of twins and families , 1993 .

[9]  Christian Gieger,et al.  Seventy-five genetic loci influencing the human red blood cell , 2012, Nature.

[10]  Peter Donnelly,et al.  Progress and promise in understanding the genetic basis of common diseases , 2015, Proceedings of the Royal Society B: Biological Sciences.

[11]  D. Baker,et al.  The education effect on population health: a reassessment. , 2011, Population and development review.

[12]  L. Isserlis ON A FORMULA FOR THE PRODUCT-MOMENT COEFFICIENT OF ANY ORDER OF A NORMAL FREQUENCY DISTRIBUTION IN ANY NUMBER OF VARIABLES , 1918 .

[13]  Andrew D. Johnson,et al.  Parent-of-origin specific allelic associations among 106 genomic loci for age at menarche , 2014, Nature.

[14]  A. Price,et al.  Dissecting the genetics of complex traits using summary association statistics , 2016, Nature Reviews Genetics.

[15]  M. Silles The causal effect of education on health: Evidence from the United Kingdom , 2009 .

[16]  Christian Gieger,et al.  Edinburgh Research Explorer Common variants at 10 genomic loci influence hemoglobin A(C) levels via glycemic and nonglycemic pathways , 2010 .

[17]  Christian Gieger,et al.  New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk , 2010, Nature Genetics.

[18]  Sang Hong Lee,et al.  Estimation of pleiotropy between complex diseases using single-nucleotide polymorphism-derived genomic relationships and restricted maximum likelihood , 2012, Bioinform..

[19]  M. Daly,et al.  An Atlas of Genetic Correlations across Human Diseases and Traits , 2015, Nature Genetics.

[20]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[21]  Yakir A Reshef,et al.  Partitioning heritability by functional annotation using genome-wide association summary statistics , 2015, Nature Genetics.

[22]  Jonathan P. Beauchamp,et al.  Genetic variants associated with subjective well-being, depressive symptoms and neuroticism identified through genome-wide analyses , 2016, Nature Genetics.

[23]  Jun S. Liu,et al.  Genetics of rheumatoid arthritis contributes to biology and drug discovery , 2013 .

[24]  W. Rathmann,et al.  Age at Menarche and Its Association with the Metabolic Syndrome and Its Components: Results from the KORA F4 Study , 2011, PloS one.

[25]  Joseph K. Pickrell,et al.  Approximately independent linkage disequilibrium blocks in human populations , 2015, bioRxiv.

[26]  Mark A Pereira,et al.  Earlier age at menarche is associated with higher diabetes risk and cardiometabolic disease risk factors in Brazilian adults: Brazilian Longitudinal Study of Adult Health (ELSA-Brasil) , 2014, Cardiovascular Diabetology.

[27]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[28]  R. Hubbard,et al.  Risk of schizophrenia in people with coeliac disease, ulcerative colitis and Crohn's disease: a general population‐based study , 2006, Alimentary pharmacology & therapeutics.

[29]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[30]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[31]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[32]  Jonathan P. Beauchamp,et al.  Genome-wide association study identifies 74 loci associated with educational attainment , 2016, Nature.

[33]  Tanya M. Teslovich,et al.  Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes , 2012, Nature Genetics.

[34]  M. Schwarz,et al.  Immune System and Schizophrenia. , 2010, Current immunology reviews.

[35]  G. Carey,et al.  Inference about genetic correlations , 1988, Behavior genetics.

[36]  R. Elston,et al.  The investigation of linkage between a quantitative trait and a marker locus , 1972, Behavior genetics.

[37]  Beth Wilmot,et al.  Edinburgh Explorer Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture , 2022 .

[38]  Dohan Fc More on celiac disease as a model for schizophrenia. , 1983 .

[39]  B. Pasaniuc,et al.  Contrasting the genetic architecture of 30 complex traits from summary association data , 2016, bioRxiv.

[40]  Tomaz Berisa,et al.  Detection and interpretation of shared genetic influences on 40 human traits , 2015 .

[41]  Jonathan P. Beauchamp,et al.  Genetic Associations with Subjective Well-Being Also Implicate Depression and Neuroticism , 2015, bioRxiv.

[42]  Christian Gieger,et al.  A genome-wide meta-analysis identifies 22 loci associated with eight hematological parameters in the HaemGen consortium , 2009, Nature Genetics.

[43]  J. Mefford,et al.  The Covariate's Dilemma , 2012, PLoS genetics.

[44]  Peter Kraft,et al.  Adjusting for heritable covariates can bias effect estimates in genome-wide association studies. , 2015, American journal of human genetics.

[45]  J. P. Hegmann,et al.  Estimating genetic correlations from inbred strains , 1981, Behavior genetics.

[46]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[47]  Bogdan Pasaniuc,et al.  Contrasting the genetic architecture of 30 complex traits from summary association data , 2016, bioRxiv.

[48]  G. Siest,et al.  Genetic influences on lipid metabolism trait variability within the Stanislas Cohort. , 2001, Journal of lipid research.

[49]  John Spertus,et al.  Plasma HDL cholesterol and risk of myocardial infarction: a mendelian randomisation study , 2012, The Lancet.

[50]  Bradley Efron,et al.  Bayesian inference and the parametric bootstrap. , 2012, The annals of applied statistics.

[51]  Tom R. Gaunt,et al.  LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis , 2016, bioRxiv.

[52]  A. Gusev,et al.  Integrating gene expression with summary association statistics to identify susceptibility genes for 30 complex traits , 2016, bioRxiv.

[53]  Tamara S. Roman,et al.  New genetic loci link adipose and insulin biology to body fat distribution , 2014, Nature.

[54]  M. Daly,et al.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies , 2014, Nature Genetics.

[55]  George Davey Smith,et al.  Mendelian randomization: Using genes as instruments for making causal inferences in epidemiology , 2008, Statistics in medicine.

[56]  Sarah Lewis,et al.  Genetic epidemiology and public health: hope, hype, and future prospects , 2005, The Lancet.

[57]  R. Macdermott,et al.  Alterations of the immune system in ulcerative colitis and Crohn's disease. , 1988, Advances in immunology.

[58]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[59]  M. Tobin,et al.  Mendelian Randomisation and Causal Inference in Observational Epidemiology , 2008, PLoS medicine.