Discriminatory power of common genetic variants in personalized breast cancer diagnosis

Technology advances in genome-wide association studies (GWAS) has engendered optimism that we have entered a new age of precision medicine, in which the risk of breast cancer can be predicted on the basis of a person’s genetic variants. The goal of this study is to evaluate the discriminatory power of common genetic variants in breast cancer risk estimation. We conducted a retrospective case-control study drawing from an existing personalized medicine data repository. We collected variables that predict breast cancer risk: 153 high-frequency/low-penetrance genetic variants, reflecting the state-of-the-art GWAS on breast cancer, mammography descriptors and BI-RADS assessment categories in the Breast Imaging Reporting and Data System (BI-RADS) lexicon. We trained and tested naïve Bayes models by using these predictive variables. We generated ROC curves and used the area under the ROC curve (AUC) to quantify predictive performance. We found that genetic variants achieved comparable predictive performance to BI-RADS assessment categories in terms of AUC (0.650 vs. 0.659, p-value = 0.742), but significantly lower predictive performance than the combination of BI-RADS assessment categories and mammography descriptors (0.650 vs. 0.751, p-value < 0.001). A better understanding of relative predictive capability of genetic variants and mammography data may benefit clinicians and patients to make appropriate decisions about breast cancer screening, prevention, and treatment in the era of precision medicine.

[1]  Thomas Brüning,et al.  CYP2B6*6 is associated with increased breast cancer risk , 2014, International journal of cancer.

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  Wei Lu,et al.  Functional variants at the 11q13 risk locus for breast cancer regulate cyclin D1 expression through long-range enhancers. , 2013, American journal of human genetics.

[4]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[5]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[6]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[7]  Julian Peto,et al.  Genetic Predisposition to In Situ and Invasive Lobular Carcinoma of the Breast , 2014, PLoS genetics.

[8]  C. McCarty,et al.  Marshfield Clinic Personalized Medicine Research Project (PMRP): design, methods and recruitment for a large population-based biobank. , 2005, Personalized medicine.

[9]  David Page,et al.  Genetic Variants Improve Breast Cancer Risk Prediction on Mammograms , 2013, AMIA.

[10]  Patrick Neven,et al.  Identification of New Genetic Susceptibility Loci for Breast Cancer Through Consideration of Gene‐Environment Interactions , 2014, Genetic epidemiology.

[11]  C. D. Page,et al.  Comparing Mammography Abnormality Features to Genetic Variants in the Prediction of Breast Cancer in Women Recommended for Breast Biopsy. , 2016, Academic radiology.

[12]  W. Willett,et al.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer , 2007, Nature Genetics.

[13]  Mads Thomassen,et al.  Identification of a BRCA2-Specific Modifier Locus at 6p24 Related to Breast Cancer Risk , 2013, PLoS genetics.

[14]  M. Thun,et al.  Performance of Common Genetic Variants in Breast-cancer Risk Models , 2022 .

[15]  Yirong Wu,et al.  Using Multidimensional Mutual Information to Prioritize Mammographic Features for Breast Cancer Diagnosis , 2013, AMIA.

[16]  Orli G. Bahcall,et al.  iCOGS collection provides a collaborative model , 2013, Nature Genetics.

[17]  Jaana M. Hartikainen,et al.  Large-scale genotyping identifies 41 new loci associated with breast cancer risk , 2013, Nature Genetics.

[18]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[19]  Jianxin Shi,et al.  A Genome-wide Association Study of Early-Onset Breast Cancer Identifies PFKM as a Novel Breast Cancer Gene and Supports a Common Genetic Spectrum for Breast Cancer at Any Age , 2014, Cancer Epidemiology, Biomarkers & Prevention.

[20]  Peter Devilee,et al.  A tiny step closer to personalized risk prediction for breast cancer. , 2010, The New England journal of medicine.

[21]  E. Burnside,et al.  New Genetic Variants Improve Personalized Breast Cancer Diagnosis , 2014, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[22]  David Page,et al.  Information Extraction for Clinical Data Mining: A Mammography Case Study , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[23]  Peter A. Fasching,et al.  The UGT1A6_19_GG genotype is a breast cancer risk factor , 2013, Front. Genet..

[24]  Yirong Wu,et al.  A Comprehensive Methodology for Determining the Most Informative Mammographic Features , 2013, Journal of Digital Imaging.

[25]  Jane E. Carpenter,et al.  Genetic variation in mitotic regulatory pathway genes is associated with breast tumor grade. , 2014, Human molecular genetics.

[26]  Thomas Brüning,et al.  Investigation of gene‐environment interactions between 47 newly identified breast cancer susceptibility loci and environmental risk factors , 2015, International journal of cancer.

[27]  Katherine L Nathanson,et al.  Common breast cancer risk variants in the post-COGS era: a comprehensive review , 2013, Breast Cancer Research.

[28]  Karen L. Mohlke,et al.  Genetic Risk Prediction — Are We There Yet? , 2009 .

[29]  M. Gail Value of adding single-nucleotide polymorphism genotypes to a breast cancer risk model. , 2009, Journal of the National Cancer Institute.

[30]  M. Gail Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk. , 2008, Journal of the National Cancer Institute.

[31]  Nilanjan Chatterjee,et al.  Estimation of effect size distribution from genome-wide association studies and implications for future discoveries , 2010, Nature Genetics.

[32]  Kenneth Offit,et al.  Two Decades After BRCA: Setting Paradigms in Personalized Cancer Care and Prevention , 2014, Science.

[33]  Wei Lu,et al.  Fine-scale mapping of the FGFR2 breast cancer risk locus: putative functional variants differentially bind FOXA1 and E2F1. , 2013, American journal of human genetics.

[34]  D. Vanel The American College of Radiology (ACR) Breast Imaging and Reporting Data System (BI-RADS): a step towards a universal radiological language? , 2007, European journal of radiology.

[35]  Patrick Neven,et al.  Genetic variation at CYP3A is associated with age at menarche and breast cancer risk: a case-control study , 2014, Breast Cancer Research.

[36]  Matthias W. Beckmann,et al.  2q36.3 is associated with prognosis for oestrogen receptor-negative breast cancer patients treated with chemotherapy , 2014, Nature Communications.

[37]  David Page,et al.  Comparing the Value of Mammographic Features and Genetic Variants in Breast Cancer Risk Prediction , 2014, AMIA.

[38]  C. Metz,et al.  Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data. , 1998, Statistics in medicine.