Identification of mammary lesions in thermographic images: feature selection study using genetic algorithms and particle swarm optimization

The incidence of breast cancer increases every year. Early detection of the disease is critical since the sooner the disease is discovered the better are the treatments and the chances of cure. Mammography is now the gold standard for the diagnosis of breast cancer, but this screening tool has some limitations. Infrared thermography is being studied as a complementary tool due to its benefits. The combination of specialized professionals with methods of digital image analysis in breast thermography can contribute to improve diagnosis performance. From this, several research groups have been proposing methods to treat these data. Feature selection plays a fundamental role in this process, since it may optimize the machine learning process. In this study, we propose a feature selection approach using genetic algorithms (GA) and particle swarm optimization (PSO) in thermographic images with breast lesions. The main goal of this approach is to optimize the identification and classification of breast lesions. We used several classifiers to assess the performance of the subsets with selected features. Support vector machines were more effective in these experiments. It was possible to reduce from 169 features with accuracy of 91.12% to 57 features with accuracy of 87.08% using the genetic algorithm. We also found a subset of 60 features with an accuracy of 86.16% using PSO. The features selection step could optimize classification time and computational cost. By varying input parameters, we could achieve a significant reduction in the amount of features. Features reduction could be done without losing much in accuracy, when compared to the whole set of features.

[1]  Y. Tavares,et al.  Utilização de Otimização por Enxame de Partículas e Algoritmos Genéticos em Rastreamento de Padrões , 2015 .

[2]  Abel G. Silva-Filho,et al.  A semi-supervised fuzzy GrowCut algorithm to segment and classify regions of interest of mammographic images , 2016, Expert Syst. Appl..

[3]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[4]  Samir Kumar Bandyopadhyay,et al.  Survey on Segmentation Methods for Locating Masses in a Mammogram Image , 2010 .

[5]  Humberto César Brandão de Oliveira,et al.  APLICAÇÃO DE UMA METODOLOGIA COMPUTACIONAL INTELIGENTE NO DIAGNÓSTICO DE LESÕES CANCERÍGENAS , 2011 .

[6]  Susmija Jabbireddy,et al.  Extraction of medically interpretable features for classification of malignancy in breast thermography , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[7]  Rita de Cássia Fernandes de Lima,et al.  Breast cancer diagnosis based on mammary thermography and extreme learning machines , 2018 .

[8]  Martin Styner,et al.  Objective Evaluation of Multiple Sclerosis Lesion Segmentation using a Data Management and Processing Infrastructure , 2018, Scientific Reports.

[9]  E. Yu,et al.  Infrared Imaging of the Breast: Initial Reappraisal Using High‐Resolution Digital Technology in 100 Successive Cases of Stage I and II Breast Cancer , 1998, The breast journal.

[10]  Julio Eduardo Arce,et al.  Use of Metaheuristics particle swarm optimization in forest planning. , 2012 .

[11]  Wellington Pinheiro dos Santos,et al.  Detection and classification of mammary lesions using artificial neural networks and morphological wavelets , 2018 .

[12]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[13]  Allan R. S. Feitosa,et al.  Reconstruction of electrical impedance tomography images using particle swarm optimization, genetic algorithms and non-blind search , 2014, 5th ISSNIP-IEEE Biosignals and Biorobotics Conference (2014): Biosignals and Robotics for Better and Safer Living (BRC).

[14]  Mantao Xu,et al.  Classification of Imbalanced Data by Using the SMOTE Algorithm and Locally Linear Embedding , 2006, 2006 8th international Conference on Signal Processing.

[15]  Wellington Pinheiro dos Santos,et al.  Detection and classification of masses in mammographic images in a multi-kernel approach , 2016, Comput. Methods Programs Biomed..

[16]  Anselmo Cardoso de Paiva,et al.  Automatic detection of solitary lung nodules using quality threshold clustering, genetic algorithm and diversity index , 2014, Artif. Intell. Medicine.

[17]  Abel G. Silva-Filho,et al.  Segmentation of Mammography by Applying Extreme Learning Machine in Tumor Detection , 2012, IDEAL.

[18]  Huei Diana Lee,et al.  Seleção de atributos importantes para a extração de conhecimento de bases de dados , 2005 .

[19]  Lance Chun Che Fung,et al.  Classification of Imbalanced Data by Combining the Complementary Neural Network and SMOTE Algorithm , 2010, ICONIP.

[20]  Allan R. S. Feitosa,et al.  Reconstruction of electrical impedance tomography images using genetic algorithms and non-blind search , 2014, 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI).

[21]  J. H. de Vasconcelos,et al.  Analysis of Methods of Classification of Breast Thermographic Images to Determine their Viability in the Early Breast Cancer Detection , 2018 .

[22]  Allan R. S. Feitosa,et al.  Reconstruction of electrical impedance tomography images using chaotic ring-topology particle swarm optimization and non-blind search , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[23]  P. Porter,et al.  "Westernizing" women's risks? Breast cancer in lower-income countries. , 2008, The New England journal of medicine.

[24]  George D. C. Cavalcanti,et al.  Mammographic images segmentation using texture descriptors , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[25]  Wellington Pinheiro dos Santos,et al.  Reconstruction of Electrical Impedance Tomography Using Fish School Search, Non-Blind Search, and Genetic Algorithm , 2017, Int. J. Swarm Intell. Res..

[26]  Abel G. Silva-Filho,et al.  Segmentation of Mammography by Applying GrowCut for Mass Detection , 2013, MedInfo.

[27]  Aura Conci,et al.  Breast thermography from an image processing viewpoint: A survey , 2013, Signal Process..

[28]  Wellington Pinheiro dos Santos,et al.  Fuzzy Morphological Extreme Learning Machines to detect and classify masses in mammograms , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[29]  Maurício Marengoni,et al.  Applying Artificial Neural Network for the Classification of Breast Cancer Using Infrared Thermographic Images , 2016, ICCVG.

[30]  Adriano Honorato de Souza,et al.  Seleção de atributos relevantes: aplicando técnicas na base de dados do Herbário Virtual da Flora e dos Fungos , 2017 .

[31]  A. Stein,et al.  Guidelines for early detection of breast cancer in Brazil. I - Development methods. , 2018, Cadernos de saude publica.