An Evolutionary Ensemble-Based Method for Rule Extraction with Distributed Data

This paper presents a methodology for knowledge discovery from inherently distributed data without moving it from its original location, completely or partially, to other locations for legal or competition issues. It is based on a novel technique that performs in two stages: first, discovering the knowledge locally and second, merging the distributed knowledge acquired in every location in a common privacy aware maximizing the global accuracy by using evolutionary models. The knowledge obtained in this way improves the one achieved in the local stores, thus it is of interest for the concerned organizations.

[1]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[2]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[3]  Stephen F. Smith,et al.  Competition-Based Induction of Decision Models from Examples , 2004, Machine Learning.

[4]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[5]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[6]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[7]  John H. Holland,et al.  COGNITIVE SYSTEMS BASED ON ADAPTIVE ALGORITHMS1 , 1978 .

[8]  Foster J. Provost,et al.  Scaling Up: Distributed Machine Learning with Cooperation , 1996, AAAI/IAAI, Vol. 1.

[9]  Larry J. Eshelman,et al.  The CHC Adaptive Search Algorithm: How to Have Safe Search When Engaging in Nontraditional Genetic Recombination , 1990, FOGA.

[10]  Antonio Peregrín,et al.  Efficient Distributed Genetic Algorithm for Rule extraction , 2011, Appl. Soft Comput..

[11]  Ricardo Vilalta,et al.  Metalearning - Applications to Data Mining , 2008, Cognitive Technologies.

[12]  Larry J. Eshelman The CHC Adaptive Search Algo-rithm , 1991 .

[13]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[14]  S. Smith,et al.  A Learning System Based on Genetic Algorithms , 1980 .

[15]  Stephen F. Smith,et al.  A learning system based on genetic adaptive algorithms , 1980 .

[16]  Gilles Venturini,et al.  SIA: A Supervised Inductive Algorithm with Genetic Search for Learning Attributes based Concepts , 1993, ECML.

[17]  Peter J. Fleming,et al.  Performance optimization of gas turbine engine , 2005, Eng. Appl. Artif. Intell..

[18]  Donald A. Waterman,et al.  Pattern-Directed Inference Systems , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Matthias Klusch,et al.  Distributed data mining and agents , 2005, Eng. Appl. Artif. Intell..

[20]  Filippo Neri,et al.  Search-Intensive Concept Induction , 1995, Evolutionary Computation.