A parallel genetic algorithm for rule mining

Rule mining consists of discovering valid and useful rules in large databases. As other data mining tasks, it is known to be time-consuming and I/O intensive. Evolutionary algorithms and parallelism are two important ways to deal with that performance problem. In this paper, we propose a parallel genetic algorithm for rule discovery, namely . We evaluated it on the Nursery School public domain data set available from the UCI Repository of Machine Learning databases. The results show that is efficient and allows to discover high quality rules.

[1]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[2]  Padhraic Smyth,et al.  Rule Induction Using Information Theory , 1991, Knowledge Discovery in Databases.

[3]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[4]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[5]  Ke Wang,et al.  Interestingness-Based Interval Merger for Numeric Association Rules , 1998, KDD.

[6]  H.S. Lopes,et al.  A parallel genetic algorithm for rule discovery in large databases , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[7]  Vipin Kumar,et al.  Scalable parallel data mining for association rules , 1997, SIGMOD '97.

[8]  Mohammed J. Zaki Parallel and distributed association mining: a survey , 1999, IEEE Concurr..

[9]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[10]  Alex Alves Freitas,et al.  On rule interestingness measures , 1999, Knowl. Based Syst..