Genetic Programming for Multiple Feature Construction in Skin Cancer Image Classification

Skin cancer is a common cancer worldwide, with melanoma being the most deadly form which is treatable when diagnosed at an early stage. This study develops a novel classification approach using multi-tree genetic programming (GP), which not only targets melanoma detection but is also capable of distinguishing between ten different classes of skin cancer effectively from lesion images. Selecting a suitable feature extraction method and the way different types of features are combined are important aspects to achieve performance gains. Existing approaches remain unable to effectively design a way to combine various features. Moreover, they have not used multi-channel multi-resolution spatial/frequency information for effective feature construction. In this work, wavelet-based texture features from multiple color channels are employed which preserve all the local, global, color and texture information concurrently. Local Binary Pattern, lesion color variation, and geometrical border shape features are also extracted from various color channels. The performance of the proposed method is evaluated using two skin image datasets and compared with an existing multi-tree GP method, ten single-tree GP methods, and six commonly used classification algorithms. The results reveal the goodness of the proposed method which significantly outperformed all these classification methods and demonstrate the potential to help dermatologist in making a diagnosis in real-time situations.

[1]  M. N. Giri Prasad,et al.  Melanoma Is Skin Deep: A 3D Reconstruction Technique for Computerized Dermoscopic Skin Lesion Classification , 2017, IEEE Journal of Translational Engineering in Health and Medicine.

[2]  Nikhil R. Pal,et al.  A novel approach to design classifiers using genetic programming , 2004, IEEE Transactions on Evolutionary Computation.

[3]  J. Koza,et al.  A Genetic Programming Tutorial , 2003 .

[4]  Mengjie Zhang,et al.  Genetic programming for multiple-feature construction on high-dimensional classification , 2019, Pattern Recognit..

[5]  Pedro M. Ferreira,et al.  PH2 - A dermoscopic image database for research and benchmarking , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[6]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[7]  Ilias Maglogiannis,et al.  Overview of Advanced Computer Vision Systems for Skin Lesions Characterization , 2009, IEEE Transactions on Information Technology in Biomedicine.

[8]  M. Oltean,et al.  Multi Expression Programming , 2021 .

[9]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[10]  A. Jemal,et al.  Cancer statistics, 2019 , 2019, CA: a cancer journal for clinicians.

[11]  Bing Xue,et al.  Multi-tree Genetic Programming with A New Fitness Function for Melanoma Detection , 2019, 2019 IEEE Congress on Evolutionary Computation (CEC).

[12]  Bing Xue,et al.  A Multi-tree Genetic Programming Representation for Melanoma Detection Using Local and Global Features , 2018, Australasian Conference on Artificial Intelligence.

[13]  Robert B. Fisher,et al.  A Color and Texture Based Hierarchical K-NN Approach to the Classification of Non-melanoma Skin Lesions , 2013 .

[14]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[15]  James Bailey,et al.  Computer-Aided Diagnosis of Melanoma Using Border- and Wavelet-Based Texture Analysis , 2012, IEEE Transactions on Information Technology in Biomedicine.

[16]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[17]  Bing Xue,et al.  A Multitree Genetic Programming Representation for Automatically Evolving Texture Image Descriptors , 2017, SEAL.

[18]  Mengjie Zhang,et al.  Generating Redundant Features with Unsupervised Multi-Tree Genetic Programming , 2018, EuroGP.