From Genotype to Phenotype: Polygenic Prediction of Complex Human Traits.

Decoding the genome confers the capability to predict characteristics of the organism (phenotype) from DNA (genotype). We describe the present status and future prospects of genomic prediction of complex traits in humans. Some highly heritable complex phenotypes such as height and other quantitative traits can already be predicted with reasonable accuracy from DNA alone. For many diseases, including important common conditions such as coronary artery disease, breast cancer, type I and II diabetes, individuals with outlier polygenic scores (e.g., top few percent) have been shown to have 5 or even 10 times higher risk than average. Several psychiatric conditions such as schizophrenia and autism also fall into this category. We discuss related topics such as the genetic architecture of complex traits, sibling validation of polygenic scores, and applications to adult health, in vitro fertilization (embryo selection), and genetic engineering.

[1]  W. O'NAN What is it? , 1952, The Journal of the Kentucky State Medical Association.

[2]  R. Keith OPPORTUNITIES FOR PREVENTION , 1965 .

[3]  O. Hill A Twin Study , 1968, British Journal of Psychiatry.

[4]  D. Easton,et al.  Estimates of the gene frequency of BRCA1 and its contribution to breast and ovarian cancer incidence. , 1995, American journal of human genetics.

[5]  J. Schulman,et al.  Preimplantation diagnosis in disease control, not eugenics. , 1996, Human reproduction.

[6]  P. Vernon,et al.  Heritability of the big five personality dimensions and their facets: a twin study. , 1996, Journal of personality.

[7]  D. Wikler,et al.  Can we learn from eugenics? , 1999, Journal of medical ethics.

[8]  A. Whittemore,et al.  Prevalence of BRCA1 mutation carriers among U.S. non-Hispanic Whites. , 2004, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[9]  Pgd,et al.  Preimplantation genetic diagnosis. , 2019, Fertility and sterility.

[10]  Philip Brey,et al.  Ethical aspects of facial recognition systems in public places , 2004, J. Inf. Commun. Ethics Soc..

[11]  E. Juengst FACE Facts: Why Human Genetics Will Always Provoke Bioethics , 2004, The Journal of law, medicine & ethics : a journal of the American Society of Law, Medicine & Ethics.

[12]  Michael J. Sandel Embryo ethics--the moral logic of stem-cell research. , 2004, The New England journal of medicine.

[13]  P. Brodwin “Bioethics in Action” and Human Population Genetics Research , 2005, Culture, medicine and psychiatry.

[14]  K. Offit BRCA mutation frequency and penetrance: new data, old debate. , 2006, Journal of the National Cancer Institute.

[15]  M. Ekberg The Old Eugenics and the New Genetics Compared , 2007 .

[16]  C. Hurley,et al.  Disparities in utilization of coronary artery disease treatment by gender, race, and ethnicity: opportunities for prevention. , 2007, Journal of National Black Nurses' Association : JNBNA.

[17]  S. Willsie The Inhaled Steroid Treatment As Regular Therapy in Early Asthma (START) study 5-year follow-up: Effectiveness of early intervention with budesonide in mild persistent asthma , 2009 .

[18]  Joseph T. Glessner,et al.  From Disease Association to Risk Assessment: An Optimistic View from Genome-Wide Association Studies on Type 1 Diabetes , 2009, PLoS genetics.

[19]  F. Herrmann,et al.  Physical activity reduces systemic blood pressure and improves early markers of atherosclerosis in pre-pubertal obese children. , 2009, Journal of the American College of Cardiology.

[20]  S. Yusuf,et al.  Early versus delayed invasive intervention in acute coronary syndromes. , 2009, The New England journal of medicine.

[21]  P. Levine,et al.  The Oxford Handbook of The History of Eugenics , 2010 .

[22]  James G. Scott,et al.  The horseshoe estimator for sparse signals , 2010 .

[23]  J. Crow On epistasis: why it is unimportant in polygenic directional selection , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[24]  L. Garrison,et al.  A Formal Risk-benefit Framework for Genomic Tests: Facilitating the Appropriate Translation of Genomics into Clinical Practice , 2022 .

[25]  D. Evans,et al.  Assessing women at high risk of breast cancer: a review of risk assessment models. , 2010, Journal of the National Cancer Institute.

[26]  Jeremy M. Harris,et al.  Genomics in Clinical Practice: Lessons from the Front Lines , 2013, Science Translational Medicine.

[27]  Richard Mateosian,et al.  Ethics of Big Data , 2013, IEEE Micro.

[28]  C. Carlson,et al.  Generalization and Dilution of Association Results from European GWAS in Populations of Non-European Ancestry: The PAGE Study , 2013, PLoS biology.

[29]  Justin Zobel,et al.  Performance and Robustness of Penalized and Unpenalized Methods for Genetic Prediction of Complex Human Disease , 2013, Genetic epidemiology.

[30]  C. Chow,et al.  Applying compressed sensing to genome-wide association studies , 2014, GigaScience.

[31]  Yulian Zhao,et al.  Assisted reproduction: Ethical and legal issues. , 2014, Seminars in fetal & neonatal medicine.

[32]  Gilles Louppe,et al.  Exploiting SNP Correlations within Random Forest for Genome-Wide Association Studies , 2014, PloS one.

[33]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[34]  R. Loos,et al.  The bigger picture of FTO—the first GWAS-identified obesity gene , 2014, Nature Reviews Endocrinology.

[35]  Christopher P. Chengelis,et al.  Lessons from the Front Lines , 2014 .

[36]  T. Salakoski,et al.  Regularized Machine Learning in the Genetic Prediction of Complex Traits , 2014, PLoS genetics.

[37]  E. Ashley,et al.  Genomics in clinical practice , 2014, Heart.

[38]  Tomasz Żuradzki A situation of ethical limbo and preimplantation genetic diagnosis , 2014, Journal of Medical Ethics.

[39]  G. de los Campos,et al.  Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals , 2015, Annals of human genetics.

[40]  M. Inouye,et al.  Genomic risk prediction of complex human disease and its clinical application. , 2015, Current opinion in genetics & development.

[41]  Chiu Man Ho,et al.  Determination of nonlinear genetic architecture using compressed sensing , 2014, GigaScience.

[42]  Use of reproductive technology for sex selection for nonmedical reasons. , 2015, Fertility and sterility.

[43]  C. Chow,et al.  Uncovering the Genetic Architectures of Quantitative Traits , 2015, Computational and structural biotechnology journal.

[44]  S. Cummings,et al.  Breast cancer risk prediction using a clinical risk model and polygenic risk score , 2016, Breast Cancer Research and Treatment.

[45]  Yaojin Peng The morality and ethics governing CRISPR–Cas9 patents in China , 2016, Nature Biotechnology.

[46]  Jianxin Shi,et al.  Developing and evaluating polygenic risk prediction models for stratified disease prevention , 2016, Nature Reviews Genetics.

[47]  Elizabeth Gibney,et al.  Google AI algorithm masters ancient game of Go , 2016, Nature.

[48]  D. Ledbetter,et al.  Recommendations for the integration of genomics into clinical practice , 2016, Genetics in Medicine.

[49]  Elizabeth M Webber,et al.  Updated Evidence Report and Systematic Review for the US Preventive Services Task Force , 2016 .

[50]  W. Chung,et al.  Evaluation of Polygenic Risk Scores for Breast and Ovarian Cancer Risk Prediction in BRCA1 and BRCA2 Mutation Carriers , 2017, Journal of the National Cancer Institute.

[51]  Christopher R. Gignoux,et al.  Human demographic history impacts genetic risk prediction across diverse populations , 2016, bioRxiv.

[52]  B. Koenig,et al.  Transferring embryos with genetic anomalies detected in preimplantation testing: an Ethics Committee Opinion. , 2017, Fertility and sterility.

[53]  Zura Kakushadze,et al.  Estimating Cost Savings from Early Cancer Diagnosis , 2017, Data.

[54]  F. Ubaldi,et al.  A modern approach to the management of candidates for assisted reproductive technology procedures. , 2018, Minerva ginecologica.

[55]  Marcus Schultz-Bergin Is CRISPR an Ethical Game Changer? , 2018 .

[56]  B. Koenig,et al.  Disclosure of sex when incidentally revealed as part of preimplantation genetic testing (PGT): an Ethics Committee opinion. , 2018, Fertility and sterility.

[57]  M. Sauer,et al.  Use of preimplantation genetic testing for monogenic defects (PGT-M) for adult-onset conditions: an Ethics Committee opinion. , 2018, Fertility and sterility.

[58]  Carolyn Brokowski Do CRISPR Germline Ethics Statements Cut It? , 2018, The CRISPR journal.

[59]  Jason M. Fletcher,et al.  Genetic analysis of social-class mobility in five longitudinal studies , 2018, Proceedings of the National Academy of Sciences.

[60]  Alicia R. Martin,et al.  Current clinical use of polygenic scores will risk exacerbating health disparities , 2018 .

[61]  Louis Lello,et al.  Accurate Genomic Prediction of Human Height , 2017, Genetics.

[62]  E. Topol,et al.  The personal and clinical utility of polygenic risk scores , 2018, Nature Reviews Genetics.

[63]  P. Donnelly,et al.  The UK Biobank resource with deep phenotyping and genomic data , 2018, Nature.

[64]  K. Kiryluk,et al.  Genome-wide polygenic risk predictors for kidney disease , 2018, Nature Reviews Nephrology.

[65]  Jonathan P. Beauchamp,et al.  Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals , 2018, Nature Genetics.

[66]  G. de los Campos,et al.  Can Deep Learning Improve Genomic Prediction of Complex Human Traits? , 2018, Genetics.

[67]  Mary E. Haas,et al.  Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations , 2018, Nature Genetics.

[68]  M. Blum,et al.  Efficient implementation of penalized regression for genetic risk prediction , 2018 .

[69]  G. de los Campos,et al.  Complex-Trait Prediction in the Era of Big Data. , 2018, Trends in genetics : TIG.

[70]  Trevor Hastie,et al.  A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank , 2019, bioRxiv.

[71]  M. Blum,et al.  Efficient Implementation of Penalized Regression for Genetic Risk Prediction , 2018, Genetics.

[72]  Robert M. Maier,et al.  Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies , 2019, eLife.

[73]  Validation of Genome-Wide Polygenic Risk Scores for Coronary Artery Disease in French Canadians , 2019, Circulation. Genomic and precision medicine.

[74]  Carson C. Chow,et al.  Probabilistically-autoencoded horseshoe-disentangled multidomain item-response theory models , 2019, ArXiv.

[75]  Michael C. King,et al.  Why face recognition accuracy varies due to race , 2019, Biometric Technology Today.

[76]  Matthew S. Lebo,et al.  Polygenic Prediction of Weight and Obesity Trajectories from Birth to Adulthood , 2019, Cell.

[77]  Christina B. Azodi,et al.  Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits , 2019, G3: Genes, Genomes, Genetics.

[78]  S. Hsu,et al.  Utility and First Clinical Application of Screening Embryos for Polygenic Disease Risk Reduction , 2019, Front. Endocrinol..

[79]  T. Raben,et al.  Genomic Prediction of 16 Complex Disease Risks Including Heart Attack, Diabetes, Breast and Prostate Cancer , 2019, Scientific Reports.

[80]  H. Nelson,et al.  Risk Assessment, Genetic Counseling, and Genetic Testing for BRCA-Related Cancer in Women: Updated Evidence Report and Systematic Review for the US Preventive Services Task Force. , 2019, JAMA.

[81]  Jun Chen,et al.  Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes , 2019, Nature Communications.

[82]  Mazhar Adli,et al.  CRISPR Ethics: Moral Considerations for Applications of a Powerful Tool. , 2019, Journal of molecular biology.

[83]  Kristen S Purrington,et al.  Polygenic Risk Scores for Prediction of Breast Cancer and Breast Cancer Subtypes , 2018, American Journal of Human Genetics.

[84]  John P. Rice,et al.  Identification of common genetic risk variants for autism spectrum disorder , 2019, Nature Genetics.

[85]  R. Guerreiro,et al.  How understudied populations have contributed to our understanding of Alzheimer’s disease genetics , 2020, bioRxiv.

[86]  H. Aburatani,et al.  Population-specific and trans-ancestry genome-wide analyses identify distinct and shared genetic risk loci for coronary artery disease , 2020, Nature Genetics.

[87]  T. Raben,et al.  Genetic architecture of complex traits and disease risk predictors , 2020, Scientific Reports.

[88]  E. Vassos,et al.  Polygenic risk scores: from research tools to clinical instruments , 2020, Genome Medicine.

[89]  J. Lancaster,et al.  Development and Validation of a Clinical Polygenic Risk Score to Predict Breast Cancer Risk , 2020, JCO precision oncology.

[90]  S. A. Lambert,et al.  The Polygenic Score Catalog: an open database for reproducibility and systematic evaluation , 2020, medRxiv.

[91]  M. García-Closas,et al.  Combined Utility of 25 Disease and Risk Factor Polygenic Risk Scores for Stratifying Risk of All-Cause Mortality. , 2020, American journal of human genetics.

[92]  T. Raben,et al.  Sibling validation of polygenic risk scores and complex trait prediction , 2020, Scientific Reports.

[93]  M. Sabatello,et al.  The ethics of genetic testing for kidney diseases , 2020, Nature Reviews Nephrology.