Machine Learning-Based Analysis of MR Multiparametric Radiomics for the Subtype Classification of Breast Cancer

Objective: To investigate whether machine learning analysis of multiparametric MR radiomics can help classify immunohistochemical (IHC) subtypes of breast cancer. Study design: One hundred and thirty-four consecutive patients with pathologically-proven invasive ductal carcinoma were retrospectively analyzed. A total of 2,498 features were extracted from the DCE and DWI images, together with the new calculated images, including DCE images changing over six time points (DCEsequential) and DWI images changing over three b-values (DWIsequential). We proposed a novel two-stage feature selection method combining traditional statistics and machine learning-based methods. The accuracies of the 4-IHC classification and triple negative (TN) vs. non-TN cancers were assessed. Results: For the 4-IHC classification task, the best accuracy of 72.4% was achieved based on linear discriminant analysis (LDA) or subspace discrimination of assembled learning in conjunction with 20 selected features, and only small dependent emphasis of Kendall-tau-b for sequential features, based on the DWIsequential with the LDA model, yielding an accuracy of 53.7%. The linear support vector machine (SVM) and medium k-nearest neighbor using eight features yielded the highest accuracy of 91.0% for comparing TN to non-TN cancers, and the maximum variance for DWIsequential alone, together with a linear SVM model, achieved an accuracy of 83.6%. Conclusions: Whole-tumor radiomics on MR multiparametric images, DCE images changing over time points, and DWI images changing over different b-values provide a non-invasive analytical approach for breast cancer subtype classification and TN cancer identification.

[1]  W. Moon,et al.  Correlation of perfusion parameters on dynamic contrast‐enhanced MRI with prognostic factors and subtypes of breast cancers , 2012, Journal of magnetic resonance imaging : JMRI.

[2]  J. Ogutu,et al.  Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions , 2012, BMC Proceedings.

[3]  Harini Veeraraghavan,et al.  Breast cancer molecular subtype classifier that incorporates MRI features , 2016, Journal of magnetic resonance imaging : JMRI.

[4]  D. Dabbs,et al.  Immunohistochemical surrogate markers of breast cancer molecular classes predicts response to neoadjuvant chemotherapy , 2010, Cancer.

[5]  R. Gelber,et al.  Strategies for subtypes—dealing with the diversity of breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2011 , 2011, Annals of oncology : official journal of the European Society for Medical Oncology.

[6]  S. Mugikura,et al.  Luminal-type breast cancer: correlation of apparent diffusion coefficients with the Ki-67 labeling index. , 2015, Radiology.

[7]  O Nalcioglu,et al.  Triple-negative breast cancer: MRI features in 29 patients. , 2007, Annals of oncology : official journal of the European Society for Medical Oncology.

[8]  Chia-Feng Lu,et al.  Machine Learning–Based Radiomics for Molecular Subtyping of Gliomas , 2018, Clinical Cancer Research.

[9]  J. Affeldt,et al.  The feasibility study , 2019, The Information System Consultant’s Handbook.

[10]  Kenneth G. A. Gilhuijs,et al.  Association between rim enhancement of breast cancer on dynamic contrast-enhanced MRI and patient outcome: impact of subtype , 2014, Breast Cancer Research and Treatment.

[11]  R. Ponzone,et al.  Correlations between diffusion-weighted imaging and breast cancer biomarkers , 2012, European Radiology.

[12]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[13]  Maciej A Mazurowski,et al.  Radiogenomic analysis of breast cancer: luminal B molecular subtype is associated with enhancement dynamics at MR imaging. , 2014, Radiology.

[14]  R. A. Lerski,et al.  Magnetic resonance imaging texture analysis classification of primary breast cancer , 2016, European Radiology.

[15]  T. Uematsu,et al.  Triple-negative breast cancer: correlation between MR imaging and pathologic findings. , 2009, Radiology.

[16]  Nello Cristianini,et al.  Support Vector Machines and Kernel Methods: The New Generation of Learning Machines , 2002, AI Mag..

[17]  Lars J. Grimm,et al.  Computational approach to radiogenomics of breast cancer: Luminal A and luminal B molecular subtypes are associated with imaging features on routine breast MRI extracted using computer vision algorithms , 2015, Journal of magnetic resonance imaging : JMRI.

[18]  Qing Chang,et al.  Feature selection methods for big data bioinformatics: A survey from the search perspective. , 2016, Methods.

[19]  Paul M. Thompson,et al.  What is where and why it is important , 2007, NeuroImage.

[20]  Robert C. Wolpert,et al.  A Review of the , 1985 .

[21]  S. Rodenhuis,et al.  Magnetic resonance imaging response monitoring of breast cancer during neoadjuvant chemotherapy: relevance of breast cancer subtype. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[22]  C. Marsault,et al.  Diffusion-weighted MR imaging of the breast: advantages and pitfalls. , 2013, European journal of radiology.

[23]  Kunwei Shen,et al.  Breast Cancer: Diffusion Kurtosis MR Imaging-Diagnostic Accuracy and Correlation with Clinical-Pathologic Factors. , 2015, Radiology.

[24]  A. Madabhushi,et al.  Computerized image analysis for identifying triple-negative breast cancers and differentiating them from other molecular subtypes of breast cancer on dynamic contrast-enhanced MR images: a feasibility study. , 2014, Radiology.

[25]  D. Sodickson,et al.  Evaluation of breast cancer using intravoxel incoherent motion (IVIM) histogram analysis: comparison with malignant status, histological subtype, and molecular prognostic factors , 2016, European Radiology.

[26]  M. Mazurowski Radiogenomics: what it is and why it is important. , 2015, Journal of the American College of Radiology : JACR.

[27]  R. Samworth Optimal weighted nearest neighbour classifiers , 2011, 1101.5783.

[28]  Paolo Morandi,et al.  Pathological complete response rates following different neoadjuvant chemotherapy regimens for operable breast cancer according to ER status, in two parallel, randomized phase II trials with an adaptive study design (ECTO II) , 2012, Breast Cancer Research and Treatment.

[29]  Andriy Fedorov,et al.  Computational Radiomics System to Decode the Radiographic Phenotype. , 2017, Cancer research.

[30]  Eric M Blaschke,et al.  MRI phenotype of breast cancer: Kinetic assessment for molecular subtypes , 2015, Journal of magnetic resonance imaging : JMRI.

[31]  L. Qiu,et al.  A preliminary study , 2018, Medicine.

[32]  C. Kuhl,et al.  Mammographic, US, and MR imaging phenotypes of familial breast cancer. , 2008, Radiology.

[33]  Lihua Li,et al.  Diffusion‐weighted imaging features of breast tumours and the surrounding stroma reflect intrinsic heterogeneous characteristics of molecular subtypes in breast cancer , 2018, NMR in biomedicine.

[34]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  B. Kang,et al.  Rim sign and histogram analysis of apparent diffusion coefficient values on diffusion-weighted MRI in triple-negative breast cancer: Comparison with ER-positive subtype , 2017, PloS one.

[36]  T. Helbich,et al.  Diffusion-weighted MR for differentiation of breast lesions at 3.0 T: how does selection of diffusion protocols affect diagnosis? , 2009, Radiology.

[37]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[38]  Ruey-Feng Chang,et al.  Quantification of breast tumor heterogeneity for ER status, HER2 status, and TN molecular subtype evaluation on DCE-MRI. , 2016, Magnetic resonance imaging.

[39]  Dong-Sheng Cao,et al.  Recipe for uncovering predictive genes using support vector machines based on model population analysis , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[40]  Lars J. Grimm,et al.  Breast MRI radiogenomics: Current status and research implications , 2016, Journal of magnetic resonance imaging : JMRI.

[41]  Neil P. Jerome,et al.  Support vector machine for breast cancer classification using diffusion‐weighted MRI histogram features: Preliminary study , 2018, Journal of magnetic resonance imaging : JMRI.

[42]  Verónica Bolón-Canedo,et al.  A review of microarray datasets and applied feature selection methods , 2014, Inf. Sci..

[43]  Eun-Kyung Kim,et al.  Triple-negative invasive breast cancer on dynamic contrast-enhanced and diffusion-weighted MR imaging: comparison with other breast cancer subtypes , 2012, European Radiology.

[44]  Salvatore Piscuoglio,et al.  Breast cancer intra-tumor heterogeneity , 2014, Breast Cancer Research.

[45]  Andrzej Materka,et al.  Effects of MRI acquisition parameter variations and protocol heterogeneity on the results of texture analysis and pattern discrimination: an application-oriented study. , 2009, Medical physics.

[46]  Wolfgang Heller,et al.  Triple-negative breast cancer: therapeutic options. , 2007, The Lancet. Oncology.

[47]  T. Helbich,et al.  Multiparametric MRI of the breast: A review , 2018, Journal of magnetic resonance imaging : JMRI.

[48]  J. Xuan,et al.  Classification algorithms for phenotype prediction in genomics and proteomics. , 2008, Frontiers in bioscience : a journal and virtual library.

[49]  Allen L. Soyster,et al.  Technical Note - Convex Programming with Set-Inclusive Constraints and Applications to Inexact Linear Programming , 1973, Oper. Res..