An evolutionary system for automatic explicit rule extraction

The search for novel and useful patterns within large databases, known as data mining, has become of great importance owing to the ever-increasing amounts of data collected by large organizations. In particular, the emphasis is on heuristic search methods which are able to discover patterns that are hard or impossible to detect using standard query mechanisms and classical statistical techniques. In this paper, an evolutionary system that is capable of extracting explicit classification rules is presented. The results are compared with those obtained by other approaches.

[1]  Rudy Setiono,et al.  Use of a quasi-Newton method in a feedforward neural network construction algorithm , 1995, IEEE Trans. Neural Networks.

[2]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[3]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[4]  Lutz Prechelt,et al.  A Set of Neural Network Benchmark Problems and Benchmarking Rules , 1994 .

[5]  Samir W. Mahfoud Niching methods for genetic algorithms , 1996 .

[6]  O. Mangasarian,et al.  Semi-Supervised Support Vector Machines for Unlabeled Data Classification , 2001 .

[7]  C.A. Pena-Reyes,et al.  Designing breast cancer diagnostic systems via a hybrid fuzzy-genetic methodology , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[8]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[9]  Heinz Mühlenbein,et al.  Strategy Adaption by Competing Subpopulations , 1994, PPSN.

[10]  O. Mangasarian,et al.  Robust linear programming discrimination of two linearly inseparable sets , 1992 .

[11]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[12]  Roger W. Whatmore,et al.  Sputtered lead scandium tantalate thin films: a microstructural study , 1998 .

[13]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[14]  W. N. Street,et al.  Image analysis and machine learning applied to breast cancer diagnosis and prognosis. , 1995, Analytical and quantitative cytology and histology.

[15]  Hongjun Lu,et al.  NeuroRule: A Connectionist Approach to Data Mining , 1995, VLDB.

[16]  Alex Alves Freitas,et al.  Discovering comprehensible classification rules by using Genetic Programming: a case study in a medical domain , 1999, GECCO.

[17]  Gilles Venturini,et al.  Learning First Order Logic Rules with a Genetic Algorithm , 1995, KDD.

[18]  Heinz Mühlenbein,et al.  Predictive Models for the Breeder Genetic Algorithm I. Continuous Parameter Optimization , 1993, Evolutionary Computation.

[19]  C. J. V. Rijsbergen,et al.  Rough Sets, Fuzzy Sets and Knowledge Discovery , 1994, Workshops in Computing.

[20]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[21]  Cosimo Anglano,et al.  A Network Genetic Algorithm for Concept Learning , 1997, ICGA.

[22]  Alex A. Freitas,et al.  Discovering interesting prediction rules with a genetic algorithm , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[23]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[24]  Michael Y. Hu,et al.  Estimating breast cancer risks using neural networks , 2002, J. Oper. Res. Soc..

[25]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[26]  Xin Yao,et al.  A new evolutionary system for evolving artificial neural networks , 1997, IEEE Trans. Neural Networks.

[27]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[28]  Changhwan Lee,et al.  A Multistrategy Approach to Classification Learning in Databases , 1999, Data Knowl. Eng..

[29]  David B. Fogel,et al.  Linear and neural models for classifying breast masses , 1998, IEEE Transactions on Medical Imaging.

[30]  Joydeep Ghosh,et al.  Evaluation and ordering of rules extracted from feedforward networks , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[31]  Hisao Ishibuchi,et al.  Selecting fuzzy if-then rules for classification problems using genetic algorithms , 1995, IEEE Trans. Fuzzy Syst..