Developing a clinical utility framework to evaluate prediction models in radiogenomics

Combining imaging and genetic information to predict disease presence and behavior is being codified into an emerging discipline called “radiogenomics.” Optimal evaluation methodologies for radiogenomics techniques have not been established. We aim to develop a clinical decision framework based on utility analysis to assess prediction models for breast cancer. Our data comes from a retrospective case-control study, collecting Gail model risk factors, genetic variants (single nucleotide polymorphisms-SNPs), and mammographic features in Breast Imaging Reporting and Data System (BI-RADS) lexicon. We first constructed three logistic regression models built on different sets of predictive features: (1) Gail, (2) Gail+SNP, and (3) Gail+SNP+BI-RADS. Then, we generated ROC curves for three models. After we assigned utility values for each category of findings (true negative, false positive, false negative and true positive), we pursued optimal operating points on ROC curves to achieve maximum expected utility (MEU) of breast cancer diagnosis. We used McNemar’s test to compare the predictive performance of the three models. We found that SNPs and BI-RADS features augmented the baseline Gail model in terms of the area under ROC curve (AUC) and MEU. SNPs improved sensitivity of the Gail model (0.276 vs. 0.147) and reduced specificity (0.855 vs. 0.912). When additional mammographic features were added, sensitivity increased to 0.457 and specificity to 0.872. SNPs and mammographic features played a significant role in breast cancer risk estimation (p-value < 0.001). Our decision framework comprising utility analysis and McNemar’s test provides a novel framework to evaluate prediction models in the realm of radiogenomics.

[1]  R. F. Wagner,et al.  Reader Variability in Mammography and Its Implications for Expected Utility over the Population of Readers and Cases , 2004, Medical decision making : an international journal of the Society for Medical Decision Making.

[2]  C. McCarty,et al.  Marshfield Clinic Personalized Medicine Research Project (PMRP): design, methods and recruitment for a large population-based biobank. , 2005, Personalized medicine.

[3]  Vipat Kuruchittham,et al.  The Wisconsin Breast Cancer Epidemiology Simulation Model. , 2006, Journal of the National Cancer Institute. Monographs.

[4]  András Kocsor,et al.  ROC analysis: applications to the classification of biological sequences and 3D structures , 2008, Briefings Bioinform..

[5]  L. Liberman,et al.  The breast imaging reporting and data system: positive predictive value of mammographic features and final assessment categories. , 1998, AJR. American journal of roentgenology.

[6]  D. Vanel The American College of Radiology (ACR) Breast Imaging and Reporting Data System (BI-RADS): a step towards a universal radiological language? , 2007, European journal of radiology.

[7]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[8]  John M Boone,et al.  Estimating the Relative Utility of Screening Mammography , 2013, Medical decision making : an international journal of the Society for Medical Decision Making.

[9]  D. R. Lewis,et al.  Cancer survival and incidence from the Surveillance, Epidemiology, and End Results (SEER) program. , 2003, The oncologist.

[10]  Wendy A. Wolf,et al.  The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies , 2011, BMC Medical Genomics.

[11]  C. D. Page,et al.  Probabilistic computer model developed from clinical data in national mammography database format to classify mammographic findings. , 2009, Radiology.

[12]  Ross D. Shachter,et al.  A Bayesian network for mammography , 2000, AMIA.

[13]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[14]  M. Pencina,et al.  Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond , 2008, Statistics in medicine.

[15]  Konstantin Strauch,et al.  Breast cancer risk assessment across the risk continuum: genetic and nongenetic risk factors contributing to differential model performance , 2012, Breast Cancer Research.

[16]  Craig K. Abbey,et al.  Statistical properties of a utility measure of observer performance compared to area under the ROC curve , 2013, Medical Imaging.

[17]  N. Obuchowski ROC analysis. , 2005, AJR. American journal of roentgenology.

[18]  David Page,et al.  Comparing the Value of Mammographic Features and Genetic Variants in Breast Cancer Risk Prediction , 2014, AMIA.

[19]  Hongbing Shen,et al.  Breast cancer risk assessment with five independent genetic variants and two risk factors in Chinese women , 2012, Breast Cancer Research.

[20]  S. Cummings,et al.  Personalizing Mammography by Breast Density and Other Risk Factors for Breast Cancer: Analysis of Health Benefits and Cost-Effectiveness , 2011, Annals of Internal Medicine.

[21]  M. Kuo,et al.  Behind the numbers: Decoding molecular phenotypes with radiogenomics--guiding principles and technical considerations. , 2014, Radiology.

[22]  Yirong Wu,et al.  A Comprehensive Methodology for Determining the Most Informative Mammographic Features , 2013, Journal of Digital Imaging.

[23]  Natasha K. Stout,et al.  Chapter 7: The Wisconsin Breast Cancer Epidemiology Simulation Model , 2006 .

[24]  John Eng Receiver operating characteristic analysis: utility, reality, covariates, and the future. , 2013, Academic radiology.

[25]  A. Rutman,et al.  Radiogenomics: creating a link between molecular diagnostics and diagnostic imaging. , 2009, European journal of radiology.

[26]  Richard D. Riley,et al.  A systematic review of breast cancer incidence risk prediction models with meta-analysis of their performance , 2012, Breast Cancer Research and Treatment.

[27]  E. Burnside,et al.  New Genetic Variants Improve Personalized Breast Cancer Diagnosis , 2014, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[28]  M. Gail Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk. , 2008, Journal of the National Cancer Institute.

[29]  Margaret S Pepe,et al.  Gauging the performance of SNPs, biomarkers, and clinical factors for predicting risk of breast cancer. , 2008, Journal of the National Cancer Institute.

[30]  M. Thun,et al.  Performance of Common Genetic Variants in Breast-cancer Risk Models , 2022 .

[31]  Oguzhan Alagoz,et al.  Pursuing optimal thresholds to recommend breast biopsy by quantifying the value of tomosynthesis , 2014, Medical Imaging.

[32]  M. Gail Value of adding single-nucleotide polymorphism genotypes to a breast cancer risk model. , 2009, Journal of the National Cancer Institute.

[33]  J. Austoker,et al.  The psychological impact of mammographic screening. A systematic review , 2005, Psycho-oncology.

[34]  Andre Dekker,et al.  Radiogenomics: the search for genetic predictors of radiotherapy response. , 2014, Future oncology.

[35]  David Page,et al.  Genetic Variants Improve Breast Cancer Risk Prediction on Mammograms , 2013, AMIA.

[36]  W. Willett,et al.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer , 2007, Nature Genetics.

[37]  Craig K. Abbey,et al.  An Equivalent Relative Utility Metric for Evaluating Screening Mammography , 2010, Medical decision making : an international journal of the Society for Medical Decision Making.

[38]  C. Metz,et al.  Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data. , 1998, Statistics in medicine.

[39]  Mikael Hartman,et al.  Breast cancer risk assessment using genetic variants and risk factors in a Singapore Chinese population , 2014, Breast Cancer Research.

[40]  M. Pencina,et al.  Net reclassification improvement: computation, interpretation, and controversies: a literature review and clinician's guide. , 2014, Annals of Internal Medicine.

[41]  D. D. Maki,et al.  Radiogenomic analysis of breast cancer using MRI: a preliminary study to define the landscape. , 2012, AJR. American journal of roentgenology.

[42]  Ewout W Steyerberg,et al.  Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers , 2011, Statistics in medicine.

[43]  R. Richards-Kortum,et al.  A comparison of C/B ratios from studies using receiver operating characteristic curve analysis. , 1999, Journal of clinical epidemiology.