A Genetic Programming Strategy to Induce Logical Rules for Clinical Data Analysis

This paper proposes a machine learning approach dealing with genetic programming to build classifiers through logical rule induction. In this context, we define and test a set of mutation operators across from different clinical datasets to improve the performance of the proposal for each dataset. The use of genetic programming for rule induction has generated interesting results in machine learning problems. Hence, genetic programming represents a flexible and powerful evolutionary technique for automatic generation of classifiers. Since logical rules disclose knowledge from the analyzed data, we use such knowledge to interpret the results and filter the most important features from clinical data as a process of knowledge discovery. The ultimate goal of this proposal is to provide the experts in the data domain with prior knowledge (as a guide) about the structure of the data and the rules found for each class, especially to track dichotomies and inequality. The results reached by our proposal on the involved datasets have been very promising when used in classification tasks and compared with other methods.

[1]  Stanislaw Osowski,et al.  Computerized system for recognition of autism on the basis of gene expression microarray data , 2015, Comput. Biol. Medicine.

[2]  Moshe Sipper,et al.  Evolutionary computation in medicine: an overview , 2000, Artif. Intell. Medicine.

[3]  Fernando Díaz,et al.  An evolutionary computational model applied to cluster analysis of DNA microarray data , 2013, Expert Syst. Appl..

[4]  Peter J. F. Lucas,et al.  Analysis of Notions of Diagnosis , 1998, Artif. Intell..

[5]  William F. Punch,et al.  Genetic programming for tuberculosis screening from raw X-ray images , 2018, GECCO.

[6]  Ali Ghaheri,et al.  The Applications of Genetic Algorithms in Medicine. , 2015, Oman medical journal.

[7]  Luis Muñoz,et al.  Evolving genetic programming classifiers with novelty search , 2016, Inf. Sci..

[8]  A Abu-Hanna,et al.  Prognostic methods in medicine. , 1999, Artificial intelligence in medicine.

[9]  Hitoshi Iba,et al.  Prediction of Cancer Class with Majority Voting Genetic Programming Classifier Using Gene Expression Data , 2009, TCBB.

[10]  Ghada Hany Badr,et al.  Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification , 2015, Comput. Biol. Chem..

[11]  Concha Bielza,et al.  Machine Learning in Bioinformatics , 2008, Encyclopedia of Database Systems.

[12]  Francisco Herrera,et al.  Genetics-Based Machine Learning for Rule Induction: State of the Art, Taxonomy, and Comparative Study , 2010, IEEE Transactions on Evolutionary Computation.

[13]  Hasan Ocak,et al.  A Medical Decision Support System Based on Support Vector Machines and the Genetic Algorithm for the Evaluation of Fetal Well-Being , 2013, Journal of Medical Systems.

[14]  Saud R. Alrshoud,et al.  Epileptic MEG Spike Detection Using Statistical Features and Genetic Programming with KNN , 2017, Journal of healthcare engineering.

[15]  Roohallah Alizadehsani,et al.  Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm , 2017, Comput. Methods Programs Biomed..

[16]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Zhiwen Yu,et al.  A survey on ensemble learning , 2019, Frontiers of Computer Science.

[18]  Andrzej Król,et al.  Evaluation of Novel Genetic Algorithm Generated Schemes for Positron Emission Tomography (PET)/Magnetic Resonance Imaging (MRI) Image Fusion , 2011, Journal of Digital Imaging.

[19]  Georgios Dounias,et al.  Evolving rule-based systems in two medical domains using genetic programming , 2004, Artif. Intell. Medicine.

[20]  Essam A. El-Kwae,et al.  Edge detection in medical images using a genetic algorithm , 1998, IEEE Transactions on Medical Imaging.

[21]  Francisco Herrera,et al.  A Survey on the Application of Genetic Programming to Classification , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[22]  José Antonio Castellanos Garzón,et al.  A Gene Selection Approach based on Clustering for Classification Tasks in Colon Cancer , 2016 .

[23]  Chun-Gui Xu,et al.  A genetic programming-based approach to the classification of multiclass microarray datasets , 2009, Bioinform..

[24]  Chun-Jung Juan,et al.  Prediction Model of Cervical Spine Disease Established by Genetic Programming , 2017, MISNC '17.

[25]  Tsung-Chih Lin,et al.  Genetic algorithm-based interval type-2 fuzzy model identification for people with type-1 diabetes , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[26]  Suchendra M. Bhandarkar,et al.  An edge detection technique using genetic algorithm-based optimization , 1994, Pattern Recognit..

[27]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[28]  Vili Podgorelec,et al.  Knowledge discovery with classification rules in a cardiovascular dataset , 2005, Comput. Methods Programs Biomed..

[29]  M AlshamlanHala,et al.  Genetic Bee Colony (GBC) algorithm , 2015 .

[30]  Anselmo Cardoso de Paiva,et al.  Automatic detection of solitary lung nodules using quality threshold clustering, genetic algorithm and diversity index , 2014, Artif. Intell. Medicine.

[31]  Sung-Bae Cho,et al.  The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming , 2006, Artif. Intell. Medicine.