Discovering comprehensible classification rules with a genetic algorithm

Presents a classification algorithm based on genetic algorithms (GAs) that discovers comprehensible IF-THEN rules, in the spirit of data mining. The proposed GA has a flexible chromosome encoding, where each chromosome corresponds to a classification rule. Although the number of genes (the genotype) is fixed, the number of rule conditions (the phenotype) is variable. The GA also has specific mutation operators for this chromosome encoding. The algorithm was evaluated on two public-domain real-world data sets (in the medical domains of dermatology and breast cancer).

[1]  Ryszard S. Michalski,et al.  A theory and methodology of inductive learning , 1993 .

[2]  Jan Komorowski,et al.  Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.

[3]  Filippo Neri,et al.  A Parallel Genetic Algorithm for Concept Learning , 1995, ICGA.

[4]  Peter Clark,et al.  Induction in Noisy Domains , 1987, EWSL.

[5]  H. Altay Güvenir,et al.  Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals , 1998, Artif. Intell. Medicine.

[6]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[7]  Marek Kretowski,et al.  Discovery of Decision Rules from Databases: An Evolutionary Approach , 1998, PKDD.

[8]  Cosimo Anglano,et al.  A Network Genetic Algorithm for Concept Learning , 1997, ICGA.

[9]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[10]  Marek Kretowski,et al.  An Evolutionary Algorithm Using Multivariate Discretization for Decision Rule Induction , 1999, PKDD.

[11]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[12]  David J. Hand,et al.  Construction and Assessment of Classification Rules , 1997 .

[13]  C. Janikow A Knowledge-Intensive Genetic Algorithm for Supervised Learning , 2004, Machine Learning.

[14]  Heitor Silvério Lopes,et al.  AN EVOLUTIONARY APPROACH TO SIMULATE COGNITIVE FEEDBACK LEARNING IN MEDICAL DOMAIN , 1997 .

[15]  Alex Alves Freitas,et al.  Mining Very Large Databases with Parallel Processing , 1997, The Kluwer International Series on Advances in Database Systems.

[16]  Alex A. Freitas,et al.  A Genetic Algorithm for Generalized Rule Induction , 1999 .

[17]  Elizabeth Goodman,et al.  An Introduction to GALOPPS-the "Genetic Algorithm Optimized for Portability and Parallelism" System , 1994 .

[18]  Gilles Venturini,et al.  Learning First Order Logic Rules with a Genetic Algorithm , 1995, KDD.

[19]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .