Improving genetic prediction by leveraging genetic correlations among human diseases and traits

Genomic prediction has the potential to contribute to precision medicine. However, to date, the utility of such predictors is limited due to low accuracy for most traits. Here theory and simulation study are used to demonstrate that widespread pleiotropy among phenotypes can be utilised to improve genomic risk prediction. We show how a genetic predictor can be created as a weighted index that combines published genome-wide association study (GWAS) summary statistics across many different traits. We apply this framework to predict risk of schizophrenia and bipolar disorder in the Psychiatric Genomics consortium data, finding substantial heterogeneity in prediction accuracy increases across cohorts. For six additional phenotypes in the UK Biobank data, we find increases in prediction accuracy ranging from 0.7% for height to 47% for type 2 diabetes, when using a multi-trait predictor that combines published summary statistics from multiple traits, as compared to a predictor based only on one trait.Genetic prediction of complex traits so far has limited accuracy because of insufficient understanding of the genetic risk. Here, Maier et al. develop an improved method for trait prediction that makes use of genetic correlations between traits and apply it to summary statistics of psychiatric diseases.

[1]  R. L. Quaas,et al.  Multiple Trait Evaluation Using Relatives' Records , 1976 .

[2]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[3]  M. Inouye,et al.  Genomic risk prediction of complex human disease and its clinical application. , 2015, Current opinion in genetics & development.

[4]  Naomi R. Wray,et al.  Estimating Effects and Making Predictions from Genome-Wide Marker Data , 2010, 1010.4710.

[5]  Jianxin Shi,et al.  Developing and evaluating polygenic risk prediction models for stratified disease prevention , 2016, Nature Reviews Genetics.

[6]  M. Goddard Genomic selection: prediction of accuracy and maximisation of long term response , 2009, Genetica.

[7]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[8]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[9]  P. Visscher,et al.  The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling , 2010, PLoS genetics.

[10]  Avner Schlessinger,et al.  ICAHN SCHOOL OF MEDICINE AT MOUNT SINAI , 2012 .

[11]  Hans D. Daetwyler,et al.  Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach , 2008, PloS one.

[12]  J. Lush,et al.  THE EFFICIENCY OF THREE METHODS OF SELECTION , 1942 .

[13]  Jianxin Shi,et al.  Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs , 2013, Nature Genetics.

[14]  Disorder Working Group Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4 , 2012, Nature Genetics.

[15]  Mark I McCarthy,et al.  Genomic inflation factors under polygenic inheritance , 2011, European Journal of Human Genetics.

[16]  M. Daly,et al.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies , 2014, Nature Genetics.

[17]  P. Visscher,et al.  Simultaneous Discovery, Estimation and Prediction Analysis of Complex Traits Using a Bayesian Mixture Model , 2015, PLoS genetics.

[18]  H. F. Smith,et al.  A DISCRIMINANT FUNCTION FOR PLANT SELECTION , 1936 .

[19]  Simon C. Potter,et al.  Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia , 2013, Nature Genetics.

[20]  Peter M Visscher,et al.  Prediction of individual genetic risk to disease from genome-wide association studies. , 2007, Genome research.

[21]  Tanya M. Teslovich,et al.  Genetic evidence of assortative mating in humans , 2017, Nature Human Behaviour.

[22]  C. R. Henderson,et al.  Best linear unbiased estimation and prediction under a selection model. , 1975, Biometrics.

[23]  L. Schaeffer Sire and Cow Evaluation Under Multiple Trait Models , 1984 .

[24]  Nicholas Katsanis,et al.  Molecular genetic testing and the future of clinical genomics , 2013, Nature Reviews Genetics.

[25]  Lude Franke,et al.  Erratum: Common genetic variants associated with cognitive performance identified using the proxy-phenotype method (Proc Natl Acad Sci USA (2014) 111 (13790-13794) DOI: 10.1073/pnas.1404623111) , 2015 .

[26]  P. Visscher,et al.  Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores , 2015, bioRxiv.

[27]  K. Lange,et al.  Prioritizing GWAS results: A review of statistical methods and recommendations for their application. , 2010, American journal of human genetics.

[28]  F. Dudbridge Power and Predictive Accuracy of Polygenic Risk Scores , 2013, PLoS genetics.

[29]  A. Price,et al.  Dissecting the genetics of complex traits using summary association statistics , 2016, Nature Reviews Genetics.

[30]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[31]  N. Wray,et al.  Research review: Polygenic methods and their application to psychiatric traits. , 2014, Journal of child psychology and psychiatry, and allied disciplines.

[32]  P. Visscher,et al.  Multi-trait analysis of genome-wide association summary statistics using MTAG , 2017, Nature Genetics.

[33]  P. Visscher,et al.  Pitfalls of predicting complex traits from SNPs , 2013, Nature Reviews Genetics.

[34]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[35]  C. R. Henderson SIRE EVALUATION AND GENETIC TRENDS , 1973 .

[36]  L. N. Hazel The Genetic Basis for Constructing Selection Indexes. , 1943, Genetics.

[37]  Can Yang,et al.  Improving genetic risk prediction by leveraging pleiotropy , 2013, Human Genetics.

[38]  P. Visscher,et al.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits , 2012, Nature Genetics.

[39]  Karin Meyer,et al.  A review of theoretical aspects in the estimation of breeding values for multi-trait selection , 1986 .

[40]  Daniel Gianola,et al.  Predicting genetic predisposition in humans: the promise of whole-genome markers , 2010, Nature Reviews Genetics.

[41]  C. Sabatti,et al.  Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort , 2015, Genetics.

[42]  Naomi R. Wray,et al.  Using information of relatives in genomic prediction to apply effective stratified medicine , 2017, Scientific Reports.

[43]  M. Calus,et al.  An Equation to Predict the Accuracy of Genomic Values by Combining Data from Multiple Traits, Populations, or Environments , 2015, Genetics.

[44]  M. Daly,et al.  An Atlas of Genetic Correlations across Human Diseases and Traits , 2015, Nature Genetics.

[45]  P. Visscher,et al.  Advantages and pitfalls in the application of mixed-model association methods , 2014, Nature Genetics.

[46]  Laura J. Scott,et al.  Joint Analysis of Psychiatric Disorders Increases Accuracy of Risk Prediction for Schizophrenia, Bipolar Disorder, and Major Depressive Disorder , 2015, American journal of human genetics.