How accurate can genetic predictions be?

BackgroundPre-symptomatic prediction of disease and drug response based on genetic testing is a critical component of personalized medicine. Previous work has demonstrated that the predictive capacity of genetic testing is constrained by the heritability and prevalence of the tested trait, although these constraints have only been approximated under the assumption of a normally distributed genetic risk distribution.ResultsHere, we mathematically derive the absolute limits that these factors impose on test accuracy in the absence of any distributional assumptions on risk. We present these limits in terms of the best-case receiver-operating characteristic (ROC) curve, consisting of the best-case test sensitivities and specificities, and the AUC (area under the curve) measure of accuracy. We apply our method to genetic prediction of type 2 diabetes and breast cancer, and we additionally show the best possible accuracy that can be obtained from integrated predictors, which can incorporate non-genetic features.ConclusionKnowledge of such limits is valuable in understanding the implications of genetic testing even before additional associations are identified.

[1]  N. Risch,et al.  The genetic epidemiology of cancer: interpreting family and twin studies and their implications for molecular genetic approaches. , 2001, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[2]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[3]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[4]  Hans D. Daetwyler,et al.  Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach , 2008, PloS one.

[5]  E. Steyerberg,et al.  Revisiting the clinical validity of multiplex genetic testing in complex diseases. , 2004, American journal of human genetics.

[6]  F. Collins,et al.  A vision for the future of genomics research , 2003, Nature.

[7]  Peter Kraft,et al.  Genetic risk prediction--are we there yet? , 2009, The New England journal of medicine.

[8]  J. Haines,et al.  Effects of Age, Sex, and Ethnicity on the Association Between Apolipoprotein E Genotype and Alzheimer Disease: A Meta-analysis , 1997 .

[9]  E. Dempster,et al.  Heritability of Threshold Characters. , 1950, Genetics.

[10]  Donald Goldfarb,et al.  A numerically stable dual method for solving strictly convex quadratic programs , 1983, Math. Program..

[11]  T. Valle,et al.  Prevention of type 2 diabetes mellitus by changes in lifestyle among subjects with impaired glucose tolerance. , 2001, The New England journal of medicine.

[12]  J. Haines,et al.  Effects of age, sex, and ethnicity on the association between apolipoprotein E genotype and Alzheimer disease. A meta-analysis. APOE and Alzheimer Disease Meta Analysis Consortium. , 1997, JAMA.

[13]  M. Helfand,et al.  Estimating risk of breast cancer in carriers of BRCA1 and BRCA2 mutations: a meta‐analytic approach , 2007, Statistics in medicine.

[14]  B. Bainbridge,et al.  Genetics , 1981, Experientia.

[15]  S. Humphries,et al.  Utility of genetic and non-genetic risk factors in prediction of type 2 diabetes: Whitehall II prospective cohort study , 2010, BMJ : British Medical Journal.

[16]  Muin J Khoury,et al.  A critical appraisal of the scientific basis of commercial genomic profiles used to assess health risks and personalize health interventions. , 2008, American journal of human genetics.

[17]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[18]  Muin J Khoury,et al.  Evaluation of the discriminative accuracy of genomic profiling in the prediction of common complex diseases , 2010, European Journal of Human Genetics.

[19]  P. Visscher,et al.  The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling , 2010, PLoS genetics.

[20]  D. Goldstein Common genetic variation and human traits. , 2009, The New England journal of medicine.

[21]  Cornelia M van Duijn,et al.  Genome-based prediction of common diseases: advances and prospects. , 2008, Human molecular genetics.

[22]  P. Poulsen,et al.  Heritability of Type II (non-insulin-dependent) diabetes mellitus and abnormal glucose tolerance – a population-based twin study , 1999, Diabetologia.

[23]  J. Kaprio,et al.  Environmental and heritable factors in the causation of cancer--analyses of cohorts of twins from Sweden, Denmark, and Finland. , 2000, The New England journal of medicine.

[24]  A. Robertson,et al.  The Heritability of All-or-None Traits: Viability of Poultry. , 1949, Genetics.

[25]  Tanya M. Teslovich,et al.  No Interactions Between Previously Associated 2-Hour Glucose Gene Variants and Physical Activity or BMI on 2-Hour Glucose Levels , 2012, Diabetes.

[26]  K. Hemminki Correspondence re: Risch, N.: Genetic epidemiology of cancer: interpreting family and twin studies and their implications for molecular genetic approaches. Cancer Epidemiol. Biomark. Prev., 10: 733-741, 2001. , 2002, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[27]  G A Colditz,et al.  Walking compared with vigorous physical activity and risk of type 2 diabetes in women: a prospective study. , 1999, JAMA.

[28]  J. Meyer,et al.  Familial determinants of moderate and intense physical activity: a twin study. , 1997, Medicine and science in sports and exercise.

[29]  Hon-Cheong So,et al.  A Unifying Framework for Evaluating the Predictive Power of Genetic Variants Based on the Level of Heritability Explained , 2010, PLoS genetics.

[30]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[31]  D. Falconer,et al.  Introduction to Quantitative Genetics. , 1962 .

[32]  Ewout W Steyerberg,et al.  Predictive testing for complex diseases using multiple genes: Fact or fiction? , 2006, Genetics in Medicine.

[33]  E. Tolosa,et al.  Phenotype, genotype, and worldwide genetic penetrance of LRRK2-associated Parkinson's disease: a case-control study , 2008, The Lancet Neurology.

[34]  Peter M Visscher,et al.  Prediction of individual genetic risk of complex disease. , 2008, Current opinion in genetics & development.

[35]  Stephen W. Sorensen,et al.  Lifetime risk for diabetes mellitus in the United States. , 2003, JAMA.

[36]  W. G. Hill,et al.  Heritability in the genomics era — concepts and misconceptions , 2008, Nature Reviews Genetics.