Evolution of Breast Cancer Recurrence Risk Prediction: A Systematic Review of Statistical and Machine Learning-Based Models.

PURPOSE Selection of appropriate adjuvant therapy to ultimately reduce the risk of breast cancer (BC) recurrence is a challenge for medical oncologists. Several automated risk prediction models have been developed using retrospective clinical data and have evolved significantly over the years in terms of predictors of recurrence, data usage, and predictive techniques (statistical/machine learning [ML]). METHODS Following PRISMA guidelines, we performed a systematic literature review of the aforementioned statistical and ML models published between January 2008 and December 2022 through searching five digital databases-PubMed, ScienceDirect, Scopus, Cochrane, and Web of Science. The comprehensive search yielded a total of 163 papers and after a screening process focusing on papers that dealt exclusively with statistical/ML methods, only 23 papers were deemed appropriate for further analysis. We benchmarked the studies on the basis of development, evaluation metrics, and validation strategy with an added emphasis on racial diversity of patients included in the studies. RESULTS In total, 30.4% of the included studies use statistical techniques, while 69.6% are ML-based. Among these, traditional ML models (support vector machines, decision tree, logistic regression, and naïve Bayes) are the most frequently used (26.1%) along with deep learning (26.1%). Deep learning and ensemble learning provide the most accurate predictions (AUC = 0.94 each). CONCLUSION ML-based prediction models exhibit outstanding performance, yet their practical applicability might be hindered by limited interpretability and reduced generalization. Moreover, predictive models for BC recurrence often focus on limited variables related to tumor, treatment, molecular, and clinical features. Imbalanced classes and the lack of open-source data sets impede model development and validation. Furthermore, existing models predominantly overlook African and Middle Eastern populations, as they are trained and validated mainly on Caucasian and Asian patients.

[1]  P. Rajpurkar,et al.  Multimodal biomedical AI , 2022, Nature Medicine.

[2]  W. Woodward,et al.  A multi‐institutional prediction model to estimate the risk of recurrence and mortality after mastectomy for T1‐2N1 breast cancer , 2022, Cancer.

[3]  Q. Sun,et al.  Comparison of CTS5 risk model and 21-gene recurrence score assay in large-scale breast cancer population and combination of CTS5 and recurrence score to develop a novel nomogram for prognosis prediction , 2022, Breast.

[4]  T. Ishikawa,et al.  A prediction model for early systemic recurrence in breast cancer using a molecular diagnostic analysis of sentinel lymph nodes: A large‐scale, multicenter cohort study , 2022, Cancer.

[5]  Geng Tian,et al.  Prediction of HER2-positive breast cancer recurrence and metastasis risk from histopathological images and clinical information via multimodal deep learning , 2021, Computational and structural biotechnology journal.

[6]  A. Souadka,et al.  Multidisciplinary team meeting as a highly recommended EUSOMA criteria evaluating the quality of breast cancer management between centers , 2021, Breast.

[7]  E. Chuang,et al.  Prediction of Breast Cancer Recurrence Using a Deep Convolutional Neural Network Without Region-of-Interest Labeling , 2021, Frontiers in Oncology.

[8]  Yifen Zhang,et al.  Prediction of BRCA Gene Mutation in Breast Cancer Based on Deep Learning and Histopathology Images , 2021, Frontiers in Genetics.

[9]  J. Lee,et al.  Deep Learning-Based Prediction Model for Breast Cancer Recurrence Using Adjuvant Breast Cancer Cohort in Tertiary Cancer Center Registry , 2021, Frontiers in Oncology.

[10]  D. Rubin,et al.  Weakly supervised temporal model for prediction of breast cancer distant recurrence , 2021, Scientific Reports.

[11]  E. Mayo-Wilson,et al.  The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. , 2021, Journal of clinical epidemiology.

[12]  Yuan Luo,et al.  Prediction of breast cancer distant recurrence using natural language processing and knowledge-guided convolutional neural network , 2020, Artif. Intell. Medicine.

[13]  Huimin Zhao,et al.  A case-based ensemble learning system for explainable breast cancer recurrence prediction , 2020, Artif. Intell. Medicine.

[14]  N. Zerhouni,et al.  Prediction of Oncotype DX recurrence score using deep multi-layer perceptrons in estrogen receptor-positive, HER2-negative breast cancer , 2020, Breast Cancer.

[15]  W. Han,et al.  Development of a Nomogram to Predict the Recurrence Score of 21-Gene Prediction Assay in Hormone Receptor-Positive Early Breast Cancer. , 2020, Clinical breast cancer.

[16]  J. Wu,et al.  Breast cancer recurrence risk prediction using whole-lesion histogram analysis with diffusion kurtosis imaging. , 2019, Clinical radiology.

[17]  X. Shu,et al.  Overall Mortality After Diagnosis of Breast Cancer in Men vs Women. , 2019, JAMA oncology.

[18]  S. Coughlin Social determinants of breast cancer risk, stage, and survival , 2019, Breast Cancer Research and Treatment.

[19]  Tianfu Wang,et al.  Breast Cancer Detection and Diagnosis Using Mammographic Data: Systematic Review , 2019, Journal of medical Internet research.

[20]  Roie Melamed,et al.  Predicting Breast Cancer by Applying Deep Learning to Linked Health Records and Mammograms. , 2019, Radiology.

[21]  G. Pazour,et al.  Ror2 signaling regulates Golgi structure and transport through IFT20 for tumor invasiveness , 2017, Scientific Reports.

[22]  Jack Cuzick,et al.  Use of the concordance index for predictors of censored survival data , 2016, Statistical methods in medical research.

[23]  M. Mansourian,et al.  A Hybrid Computer-aided-diagnosis System for Prediction of Breast Cancer Recurrence (HPBCR) Using Optimized Ensemble Learning , 2016, Computational and structural biotechnology journal.

[24]  Miriam Seoane Santos,et al.  Predicting Breast Cancer Recurrence Using Machine Learning Techniques , 2016, ACM Comput. Surv..

[25]  F. Baehner The analytical validation of the Oncotype DX Recurrence Score assay , 2016, Ecancermedicalscience.

[26]  Rae Woong Park,et al.  Nomogram of Naive Bayesian Model for Recurrence Prediction of Breast Cancer , 2016, Healthcare informatics research.

[27]  Josh J. Carlson,et al.  The impact of the Oncotype Dx breast cancer assay in clinical practice: a systematic review and meta-analysis , 2013, Breast Cancer Research and Treatment.

[28]  R. Gelber,et al.  A risk score based on histopathological features predicts higher risk of distant recurrence in premenopausal patients with lymph node-negative endocrine-responsive breast cancer. , 2012, Breast.

[29]  Rae Woong Park,et al.  Development of Novel Breast Cancer Recurrence Prediction Model Using Support Vector Machine , 2012, Journal of breast cancer.

[30]  D. Coradini,et al.  A Prediction Model for Breast Cancer Recurrence after Adjuvant Hormone Therapy , 2008 .