Feature Selection from Image Descriptors Data for Breast Cancer Diagnosis Based on CAD

Breast cancer is an important public health problem worldwide among women. Its early detection generally increase the survival rate of patients, however, is one of the biggest deficiencies to the present. The purpose of this paper is to obtain a model capable of classifying benign and malign breast tumors, using a public dataset composed by features extracted from mammography images, obtained from the Breast Cancer Digital Repository initiative. Multivariate and univariate models were constructed using the machine learning algorithm based on CAD, Random Forest, applied to the images features. Both of the models were statistical compared looking for the better model according to their fitness. Results suggest the multivariate model has a better prediction capability than the univariate model, with an AUC between 0.991 and 0.910, however, they were found five specific descriptive features that can classify tumors with a similar fitness as the multivariate model, with AUCs between 0.897 and 0.958.

[1]  Sylvia H. Heywang-Koebrunner,et al.  Diagnostic Breast Imaging , 2000 .

[2]  Patrick Adams,et al.  The breast cancer conundrum. , 2013, Bulletin of the World Health Organization.

[3]  Frank Z. Stanczyk,et al.  Associations of Breast Cancer Risk Factors with Premenopausal Sex Hormones in Women with Very Low Breast Cancer Risk , 2016, International journal of environmental research and public health.

[4]  S. Astley,et al.  Computer-aided detection in mammography. , 2004, Clinical radiology.

[5]  Aboul Ella Hassanien,et al.  Adaptive k-means clustering algorithm for MR breast image segmentation , 2013, Neural Computing and Applications.

[6]  Heng-Da Cheng,et al.  Computer-aided detection and classification of microcalcifications in mammograms: a survey , 2003, Pattern Recognit..

[7]  Miguel Ángel Guevara-López,et al.  An evaluation of image descriptors combined with clinical data for breast cancer diagnosis , 2013, International Journal of Computer Assisted Radiology and Surgery.

[8]  Elaf J. Al Taee,et al.  Breast Cancer Diagnosis by CAD , 2014 .

[9]  L. Costaridou,et al.  Texture analysis of tissue surrounding microcalcifications on mammograms for breast cancer diagnosis. , 2007, The British journal of radiology.

[10]  J. Dheeba,et al.  Computer-aided detection of breast cancer on mammograms: A swarm intelligence optimized wavelet neural network approach , 2014, J. Biomed. Informatics.

[11]  Yilan Liao,et al.  Temporal Trends in Geographical Variation in Breast Cancer Mortality in China, 1973–2005: An Analysis of Nationwide Surveys on Cause of Death , 2016, International journal of environmental research and public health.

[12]  P. Taylor,et al.  A systematic review of computer-assisted diagnosis in diagnostic cancer imaging. , 2012, European journal of radiology.

[13]  N. Suthanthira Vanitha,et al.  Computer A ided Detection of Tumours in Mammograms , 2014 .

[14]  Anne-Marie Dixon Diagnostic Breast Imaging: Mammography, Sonography, Magnetic Resonance Imaging, and Interventional Procedures, 3rd edition , 2014 .

[15]  Mats Lambe,et al.  Serum Calcium and the Risk of Breast Cancer: Findings from the Swedish AMORIS Study and a Meta-Analysis of Prospective Studies , 2016, International journal of molecular sciences.

[16]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.