Genetic Programming for Feature Selection and Feature Construction in Skin Cancer Image Classification

The incidence of skin cancer, particularly, malignant melanoma, continues to increase worldwide. If such a cancer is not treated at an early stage, it can be fatal. A computer system based on image processing and computer vision techniques, having good diagnostic ability, can provide a quantitative evaluation of these skin cancer cites called skin lesions. The size of a medical image is usually large and therefore requires reduction in dimensionality before being processed by a classification algorithm. Feature selection and construction are effective techniques in reducing the dimensionality while improving classification performance. This work develops a novel genetic programming (GP) based two-stage approach to feature selection and feature construction for skin cancer image classification. Local binary pattern is used to extract gray and colour features from the dermoscopy images. The results of our proposed method have shown that the GP selected and constructed features have promising ability to improve the performance of commonly used classification algorithms. In comparison with using the full set of available features, the GP selected and constructed features have shown significantly better or comparable performance in most cases. Furthermore, the analysis of the evolved feature sets demonstrates the insights of skin cancer properties and validates the feature selection ability of GP to distinguish between benign and malignant cancer images.

[1]  Pedro M. Ferreira,et al.  PH2 - A dermoscopic image database for research and benchmarking , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[2]  Omar Abuzaghleh,et al.  Noninvasive Real-Time Automated Skin Lesion Analysis System for Melanoma Early Detection and Prevention , 2015, IEEE Journal of Translational Engineering in Health and Medicine.

[3]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[4]  Mahadev Satyanarayanan,et al.  Computer-aided classification of melanocytic lesions using dermoscopic images. , 2015, Journal of the American Academy of Dermatology.

[5]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[6]  Paolo Carli,et al.  Addition of dermoscopy to conventional naked-eye examination in melanoma screening: a randomized study. , 2004, Journal of the American Academy of Dermatology.

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Stein Olav Skrøvseth,et al.  Performance of a dermoscopy-based computer vision system for the diagnosis of pigmented skin lesions compared with visual evaluation by experienced dermatologists , 2014, Artif. Intell. Medicine.

[9]  M. Emre Celebi,et al.  An Overview of Melanoma Detection in Dermoscopy Images Using Image Processing and Machine Learning , 2016, ArXiv.

[10]  Mengjie Zhang,et al.  Enhanced feature selection for biomarker discovery in LC-MS data using GP , 2013, 2013 IEEE Congress on Evolutionary Computation.

[11]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[12]  Eduardo Valle,et al.  Knowledge transfer for melanoma screening with deep learning , 2017, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017).

[13]  Mengjie Zhang,et al.  Genetic programming for skin cancer detection in dermoscopic images , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[14]  G. Argenziano,et al.  Epiluminescence microscopy for the diagnosis of doubtful melanocytic skin lesions. Comparison of the ABCD rule of dermatoscopy and a new 7-point checklist based on pattern analysis. , 1998, Archives of dermatology.

[15]  Josep Malvehy,et al.  An Atlas of Dermoscopy, Second Edition , 2012 .

[16]  Mengjie Zhang,et al.  A Filter Approach to Multiple Feature Construction for Symbolic Learning Classifiers Using Genetic Programming , 2012, IEEE Transactions on Evolutionary Computation.

[17]  Mengjie Zhang,et al.  Fitness Functions in Genetic Programming for Classification with Unbalanced Data , 2007, Australian Conference on Artificial Intelligence.

[18]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[19]  P. John Clarkson,et al.  An Integrated Model of Patient and Staff Satisfaction Using Queuing Theory , 2015, IEEE Journal of Translational Engineering in Health and Medicine.

[20]  Jorge S. Marques,et al.  Improving dermoscopy image analysis using color constancy , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[21]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.