Efficient Distributed Genetic Algorithm for Rule Extraction

This paper presents an efficient distributed genetic algorithm for classification rules extraction in data mining, which is based on a new method of dynamic data distribution applied to parallelism using networks of computers in order to mine large datasets. The presented algorithm shows many advantages when compared with other distributed algorithms proposed in the specific literature. In this way, some results are presented showing significant learning rate speed-up without compromising other features.

[1]  M. J. Quinn,et al.  Parallel Computing: Theory and Practice , 1994 .

[2]  Ian W. Flockhart GA-MINER : Parallel Data Mining with Hierarchical Genetic Algorithms Final Report , 1995 .

[3]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[4]  Francisco Herrera,et al.  A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability , 2009, Soft Comput..

[5]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[6]  Alex Alves Freitas,et al.  Mining Very Large Databases with Parallel Processing , 1997, The Kluwer International Series on Advances in Database Systems.

[7]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[8]  Stephen F. Smith,et al.  Competition-based induction of decision models from examples , 1993, Machine Learning.

[9]  Gilles Venturini,et al.  SIA: A Supervised Inductive Algorithm with Genetic Search for Learning Attributes based Concepts , 1993, ECML.

[10]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[11]  Hisao Ishibuchi,et al.  Data Set Subdivision for Parallel Distributed Implementation of Genetic Fuzzy Rule Selection , 2007, 2007 IEEE International Fuzzy Systems Conference.

[12]  Xian-He Sun,et al.  Limitations of Cycle Stealing for Parallel Processing on a Network of Homogeneous Workstations , 1997, J. Parallel Distributed Comput..

[13]  Jude W. Shavlik,et al.  Knowledge-Based Artificial Neural Networks , 1994, Artif. Intell..

[14]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[15]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[16]  Foster J. Provost,et al.  Distributed Machine Learning: Scaling Up with Coarse-grained Parallelism , 1994, ISMB.

[17]  John H. Holland,et al.  Cognitive systems based on adaptive algorithms , 1977, SGAR.

[18]  Gary M. Weiss Learning with Rare Cases and Small Disjuncts , 1995, ICML.

[19]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[20]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[21]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[22]  S. Smith,et al.  A Learning System Based on Genetic Algorithms , 1980 .

[23]  Cullen Schaffer When Does Overfitting Decrease Prediction Accuracy in Induced Decision Trees and Rule Sets? , 1991, EWSL.

[24]  K. De Jong,et al.  Using Genetic Algorithms for Concept Learning , 2004, Machine Learning.

[25]  Cosimo Anglano,et al.  NOW G-Net: learning classification programs on networks of workstations , 2002, IEEE Trans. Evol. Comput..

[26]  Xiaohui Wu,et al.  Implementation of a classification-based prediction model for plant mRNA Poly(A) sites , 2008, 2008 3rd International Conference on Bio-Inspired Computing: Theories and Applications.

[27]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[28]  N. J. Radcliffe,et al.  GA-MINER: Parallel Data Mining with Hierarchical Genetic Algorithms Final Report , 1995 .

[29]  Haym Hirsh,et al.  A Quantitative Study of Small Disjuncts , 2000, AAAI/IAAI.

[30]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[31]  Erick Cantú-Paz,et al.  Efficient and Accurate Parallel Genetic Algorithms , 2000, Genetic Algorithms and Evolutionary Computation.

[32]  Filippo Neri,et al.  Search-Intensive Concept Induction , 1995, Evolutionary Computation.