Genetic Programming for the Automatic Construction of Features in Skin-Lesion Image Classification

This dissertation describes the design and implementation of a genetic programming system which automatically constructs feature equations for the classification of skin lesion images as a part of a real world dermatological image retrieval system. It uses generalized co-occurrence matrices (GCMs) and normal mathematical functions combined stochastically and evaluated using the feature selection techniques of fisher " s discriminant ratio, and the classification accuracy of either a bayes classifier or support vector machine. It deals with the notion of GP closure with " shell " functions and is able to arbitrarily combine information from different color channels, both unique designs compared with similar GP systems. Further, it can evolve features iteratively to complement each other. The implementation here is able to create features in small numbers which are able to classify better than most of the traditional set of Haralik features, even when the Haralik features are created with a greater number of GCM parameters. However, the system developed here does exhibit two notable problems for future work. The run-time is notably long and the amount of data collected in-house is not yet great enough to significantly measure the ability of the system to generalize. However, these problems are fixable and the work described has resulted in a system which aids classification relatively well and just as importantly, shows much potential.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Vladimir Vapnik Estimations of dependences based on statistical data , 1982 .

[3]  Gustavo Olague,et al.  Evolving Local Descriptor Operators through Genetic Programming , 2009, EvoWorkshops.

[4]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[5]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[6]  Sara Silva,et al.  GPLAB A Genetic Programming Toolbox for MATLAB , 2004 .

[7]  Cheng-Lung Huang,et al.  A GA-based feature selection and parameters optimizationfor support vector machines , 2006, Expert Syst. Appl..

[8]  Xiang Li,et al.  Content-Based Image Retrieval of Skin Lesions by Evolutionary Feature Synthesis , 2010, EvoApplications.

[9]  Xiang Li,et al.  A Query-by-Example Content-Based Image Retrieval System of Non-melanoma Skin Lesions , 2009, MCBR-CDS.

[10]  Antoine Geissbühler,et al.  A Review of Content{Based Image Retrieval Systems in Medical Applications { Clinical Bene(cid:12)ts and Future Directions , 2022 .

[11]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[12]  Philippe Schmid-Saugeona,et al.  Towards a computer-aided diagnosis system for pigmented skin lesions. , 2003, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[13]  R. Brereton,et al.  Support vector machines for classification and regression. , 2010, The Analyst.

[14]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[15]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[16]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[17]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[18]  Mario Giacobini Applications of Evolutinary Computing, EvoWorkshops 2007: EvoCoMnet, EvoFIN, EvoIASP,EvoINTERACTION, EvoMUSART, EvoSTOC and EvoTransLog, Valencia, Spain, April11-13, 2007, Proceedings , 2007, EvoWorkshops.

[19]  Abdesselam Bouzerdoum,et al.  The Evolutionary PreProcessor : Automatic FeatureExtraction for Supervised Classi cation using GeneticProgrammingJamie R , 1997 .

[20]  Bernhard Schölkopf,et al.  Feature selection for support vector machines by means of genetic algorithm , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[21]  T. Kanade,et al.  Color information for region segmentation , 1980 .

[22]  Randy H. Moss,et al.  A methodological approach to the classification of dermoscopy images , 2007, Comput. Medical Imaging Graph..

[23]  John R. Koza,et al.  Genetic programming: a paradigm for genetically breeding populations of computer programs to solve problems , 1990 .

[24]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[25]  Victor Ciesielski,et al.  Discovery of Human-Competitive Image Texture Feature Extraction Programs Using Genetic Programming , 2004, GECCO.

[26]  Michael Unser,et al.  Sum and Difference Histograms for Texture Classification , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Bipin C. Desai,et al.  Image Retrieval-Based Decision Support System for Dermatoscopic Images , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[28]  Ilias Maglogiannis,et al.  An integrated computer supported acquisition, handling, and characterization system for pigmented skin lesions in dermatological images , 2005, IEEE Transactions on Information Technology in Biomedicine.

[29]  Hiroshi Motoda,et al.  Book Review: Computational Methods of Feature Selection , 2007, The IEEE intelligent informatics bulletin.

[30]  Melanie Aurnhammer,et al.  Evolving Texture Features by Genetic Programming , 2009, EvoWorkshops.

[31]  Giordana Dell'Eva,et al.  Digital dermoscopy in clinical practise: a three‐centre analysis , 2007, Skin research and technology : official journal of International Society for Bioengineering and the Skin (ISBS) [and] International Society for Digital Imaging of Skin (ISDIS) [and] International Society for Skin Imaging.

[32]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .