Missing heritability: is the gap closing? An analysis of 32 complex traits in the Lifelines Cohort Study

Despite the recent explosive rise in number of genetic markers for complex disease traits identified in genome-wide association studies, there is still a large gap between the known heritability of these traits and the part explained by these markers. To gauge whether this ‘heritability gap’ is closing, we first identified genome-wide significant SNPs from the literature and performed replication analyses for 32 highly relevant traits from five broad disease areas in 13 436 subjects of the Lifelines Cohort. Next, we calculated the variance explained by multi-SNP genetic risk scores (GRSs) for each trait, and compared it to their broad- and narrow-sense heritabilities captured by all common SNPs. The majority of all previously-associated SNPs (median=75%) were significantly associated with their respective traits. All GRSs were significant, with unweighted GRSs generally explaining less phenotypic variance than weighted GRSs, for which the explained variance was highest for height (15.5%) and varied between 0.02 and 6.7% for the other traits. Broad-sense common-SNP heritability estimates were significant for all traits, with the additive effect of common SNPs explaining 48.9% of the variance for height and between 5.6 and 39.2% for the other traits. Dominance effects were uniformly small (0–1.5%) and not significant. On average, the variance explained by the weighted GRSs accounted for only 10.7% of the common-SNP heritability of the 32 traits. These results indicate that GRSs may not yet be ready for accurate personalized prediction of complex disease traits limiting widespread adoption in clinical practice.

[1]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[2]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[3]  Ronald P. Stolk,et al.  Universal risk factors for multifactorial diseases-LifeLines : a three-generation population-based study , 2008 .

[4]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[5]  B. Browning,et al.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. , 2009, American journal of human genetics.

[6]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[7]  Christian Gieger,et al.  Edinburgh Research Explorer Common variants at 10 genomic loci influence hemoglobin A(C) levels via glycemic and nonglycemic pathways , 2010 .

[8]  Mark N. Wass,et al.  Genetic loci influencing kidney function and chronic kidney disease , 2010, Nature Genetics.

[9]  Christian Gieger,et al.  New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk , 2010, Nature Genetics.

[10]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[11]  Jingyuan Fu,et al.  Common variants in 22 loci are associated with QRS duration and cardiac ventricular conduction , 2010, Nature Genetics.

[12]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[13]  Christian Gieger,et al.  Genome-wide association study of PR interval , 2010, Nature Genetics.

[14]  T. Spector,et al.  Novel genes for QTc interval. How much heritability is explained, and how much is left to find? , 2010, Genome Medicine.

[15]  W. G. Hill,et al.  Genome partitioning of genetic variation for complex traits using common SNPs , 2011, Nature Genetics.

[16]  Sylvia Stracke,et al.  CUBN is a gene locus for albuminuria. , 2011, Journal of the American Society of Nephrology : JASN.

[17]  Christian Gieger,et al.  New gene functions in megakaryopoiesis and platelet formation , 2011, Nature.

[18]  P. Elliott,et al.  Meta-Analysis of Genome-Wide Association Studies in >80 000 Subjects Identifies Multiple Loci for C-Reactive Protein Levels , 2011, Circulation.

[19]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[20]  Christian Gieger,et al.  Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma , 2011, Nature Genetics.

[21]  Tom R. Gaunt,et al.  Genetic Variants in Novel Pathways Influence Blood Pressure and Cardiovascular Disease Risk , 2011, Nature.

[22]  Christian Gieger,et al.  Genetic Variants in Novel Pathways Influence Blood Pressure and Cardiovascular Disease Risk , 2011, Nature.

[23]  Tom R. Gaunt,et al.  Meta-analysis of Dense Genecentric Association Studies Reveals Common and Uncommon Variants Associated with Height. , 2011, American journal of human genetics.

[24]  Christian Gieger,et al.  Multiple Loci Are Associated with White Blood Cell Phenotypes , 2011, PLoS genetics.

[25]  Christian Gieger,et al.  Genome-wide association and large scale follow-up identifies 16 new loci influencing lung function , 2011, Nature Genetics.

[26]  E. Lander,et al.  The mystery of missing heritability: Genetic interactions create phantom heritability , 2012, Proceedings of the National Academy of Sciences.

[27]  Josef Coresh,et al.  Chronic kidney disease , 2012, The Lancet.

[28]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[29]  Christian Gieger,et al.  Seventy-five genetic loci influencing the human red blood cell , 2012, Nature.

[30]  Tanya M. Teslovich,et al.  Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways , 2012, Nature Genetics.

[31]  Shashaank Vattikuti,et al.  Heritability and Genetic Correlations Explained by Common SNPs for Metabolic Syndrome Traits , 2012, PLoS genetics.

[32]  Yiran Guo,et al.  Gene-centric meta-analyses of 108 912 individuals confirm known body mass index loci and reveal three novel signals. , 2013, Human molecular genetics.

[33]  Claude Bouchard,et al.  Identification of heart rate-associated loci and their effects on cardiac conduction and rhythm disorders , 2014 .

[34]  P. Visscher,et al.  Pitfalls of predicting complex traits from SNPs , 2013, Nature Reviews Genetics.

[35]  N. Patterson,et al.  Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits , 2013, PLoS genetics.

[36]  Tom R. Gaunt,et al.  Loci influencing blood pressure identified using a cardiovascular gene-centric array. , 2013, Human molecular genetics.

[37]  Fabian J Theis,et al.  Genome-wide association analyses identify 18 new loci associated with serum urate concentrations , 2012, Nature Genetics.

[38]  Tanya M. Teslovich,et al.  Discovery and refinement of loci associated with lipid levels , 2013, Nature Genetics.

[39]  Tom R. Gaunt,et al.  Gene-centric association signals for haemostasis and thrombosis traits identified with the HumanCVD BeadChip , 2013, Thrombosis and Haemostasis.

[40]  Tom R. Gaunt,et al.  Gene-centric meta-analyses for central adiposity traits in up to 57 412 individuals of European descent confirm known loci and reveal several novel associations. , 2014, Human molecular genetics.

[41]  Christian Gieger,et al.  Gene-centric meta-analysis in 87,736 individuals of European ancestry identifies multiple blood-pressure-related loci. , 2014, American journal of human genetics.

[42]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[43]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[44]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[45]  Michael J Ackerman,et al.  Nature Genetics Advance Online Publication Genetic Association Study of Qt Interval Highlights Role for Calcium Signaling Pathways in Myocardial Repolarization , 2022 .

[46]  Lorna M. Lopez,et al.  Genome-wide association analysis identifies six new loci associated with forced vital capacity , 2014, Nature Genetics.

[47]  P. Visscher,et al.  Nature Genetics Advance Online Publication , 2022 .

[48]  D. Postma,et al.  Low levels of vitamin D are associated with multimorbidity: Results from the LifeLines Cohort Study , 2015, Annals of medicine.

[49]  H. Snieder,et al.  Representativeness of the LifeLines Cohort Study , 2015, PloS one.

[50]  Tamara S. Roman,et al.  New genetic loci link adipose and insulin biology to body fat distribution , 2014, Nature.

[51]  P. Visscher,et al.  Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index , 2015, Nature Genetics.

[52]  Ross M. Fraser,et al.  Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.

[53]  W. G. Hill,et al.  Dominance genetic variation contributes little to the missing heritability for human complex traits. , 2015, American journal of human genetics.

[54]  C. Wijmenga,et al.  Cohort Profile Cohort Profile : LifeLines , a three-generation cohort study and biobank , 2015 .

[55]  R. Marioni,et al.  Improving Phenotypic Prediction by Combining Genetic and Epigenetic Associations , 2015, American journal of human genetics.

[56]  Tom R. Gaunt,et al.  Edinburgh Research Explorer Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function , 2022 .

[57]  Ricardo Pong-Wong,et al.  Evaluating the contribution of genetic and familial shared environment to common disease using the UK Biobank , 2016, Nature Genetics.