Genetic Programming for Rule Discovery

In subsection 5.4.4 we saw that standard Genetic Programming (GP) for symbolic regression — where all terminals are real-valued variables or constants and all functions have real-valued inputs and output — can be used for classification, if the numeric value output at the root of the tree is properly interpreted. However, this kind of GP does not produce high-level, comprehensible IF-THEN rules in the style of the rules discovered by rule induction and decision-tree building algorithms (chapter 3) and GAs for rule discovery (chapter 6). As discussed in subsection 1.1.1, rule comprehensibility is important whenever the discovered knowledge is used for decision making by a human user.

[1]  Jano I. van Hemert,et al.  A Comparison of Genetic Programming Variants for Data Classification , 1999, IDA.

[2]  Alex A. Freitas,et al.  Data Mining with Constrained-syntax Genetic Programming: Applications in Medical Data Sets , 2001 .

[3]  Jano I. van Hemert,et al.  Adapting the Fitness Function in GP for Data Mining , 1999, EuroGP.

[4]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[5]  Vladan Babovic,et al.  Genetic Programming, Ensemble Methods and the Bias/Variance Tradeoff - Introductory Investigations , 2000, EuroGP.

[6]  M. Keijzer,et al.  Dimensionally aware genetic programming , 1999 .

[7]  Rajkumar Roy,et al.  Advances in Soft Computing: Engineering Design and Manufacturing , 1998 .

[8]  Giandomenico Spezzano,et al.  Genetic Programming and Simulated Annealing: A Hybrid Method to Evolve Decision Trees , 2000, EuroGP.

[9]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[10]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[11]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[12]  A. Engelbrecht,et al.  Searching the forest: using decision trees as building blocks for evolutionary search in classification databases , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[13]  Alex A. Freitas,et al.  A Genetic Algorithm for Generalized Rule Induction , 1999 .

[14]  David J. Montana,et al.  Strongly Typed Genetic Programming , 1995, Evolutionary Computation.

[15]  Kumar Chellapilla,et al.  Data mining using genetic programming: the implications of parsimony on generalization error , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[16]  Alex Alves Freitas,et al.  Discovering comprehensible classification rules by using Genetic Programming: a case study in a medical domain , 1999, GECCO.

[17]  Vili Podgorelec,et al.  Fighting Program Bloat with the Fractal Complexity Measure , 2000, EuroGP.

[18]  Zhiwei Fu,et al.  An Innovative GA-Based Decision Tree Classifier in Large Scale Data Mining , 1999, PKDD.

[19]  Chandrika Kamath,et al.  Using Evolutionary Algorithms to Induce Oblique Decision Trees , 2000, GECCO.

[20]  William B. Langdon,et al.  Application of Genetic Programming to Induction of Linear Classification Trees , 2000, EuroGP.

[21]  Dimitrios Kalles,et al.  Breeding Decision Trees Using Evolutionary Techniques , 2001, ICML.

[22]  Celia C. Bojarczuk,et al.  Genetic programming for knowledge discovery in chest-pain diagnosis. , 2000, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[23]  Christoph F. Eick,et al.  MASSON: discovering commonalities in collection of objects using genetic programming , 1996 .

[24]  Kwong-Sak Leung,et al.  Data Mining Using Grammar Based Genetic Programming and Applications , 2000 .

[25]  Alex Alves Freitas,et al.  Discovering Fuzzy Classification Rules with Genetic Programming and Co-evolution , 2001, PKDD.

[26]  Michael J. Pazzani,et al.  Knowledge discovery from data? , 2000, IEEE Intell. Syst..