Class association rule mining with correlation measures using genetic network programming

Association rule mining is one of the tasks of data mining and it has been extensively studied recently. As a consequence, several methods for extracting association rules have been developed during the last years. Most of them use the support and confidence framework to extract the association rules. Researches are able to extract strong rules using this framework. However these measures are not good enough to solve the quality problems of the rules. A new data mining method using Genetic Network Programming (GNP) has also been developed recently which uses the χf2 (chi-squared) as a correlation measure and its effectiveness has been shown for different datasets [1] [2]. To enhance the correlation degree and comprehensibility of association rule, several correlation measures including lift, χf2, all-confidence and cosine are studied in this paper when they are incorporated in the conventional GNP based mining algorithm. A comparison between the correlation measures is made in the simulations when they are incorporated separately into the GNP based mining method. Finally, the association rules extracted using different correlation measures are applied to the classification problems and the prediction accuracies of them are evaluated.

[1]  Shingo Mabu,et al.  A Graph-Based Evolutionary Algorithm: Genetic Network Programming (GNP) and Its Extension Using Reinforcement Learning , 2007, Evolutionary Computation.

[2]  Kotaro Hirasawa,et al.  Class Association Rule Mining with Chi-Squared Test Using Genetic Network Programming , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[3]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[4]  Geert Wets,et al.  Defining interestingness for association rules , 2003 .

[5]  Kotaro Hirasawa,et al.  A Double-Deck Elevator Group Supervisory Control System Using Genetic Network Programming , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[6]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[7]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[8]  Kotaro Hirasawa,et al.  Genetic Network Programming with Acquisition Mechanisms of Association Rules , 2006, J. Adv. Comput. Intell. Intell. Informatics.

[9]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[10]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[11]  Rajeev Motwani,et al.  Beyond Market Baskets: Generalizing Association Rules to Dependence Rules , 1998, Data Mining and Knowledge Discovery.

[12]  Kotaro Hirasawa,et al.  A study of evolutionary multiagent models based on symbiosis , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  William DuMouchel,et al.  Empirical bayes screening for multi-item associations , 2001, KDD '01.

[14]  Hajime Kimura,et al.  Unified motion planning method using random network and gradient method for multifunctional underwater robots , 2009, 2009 ICCAS-SICE.

[15]  Rajeev Motwani,et al.  Beyond market baskets: generalizing association rules to correlations , 1997, SIGMOD '97.

[16]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[17]  William DuMouchel,et al.  Bayesian Data Mining in Large Frequency Tables, with an Application to the FDA Spontaneous Reporting System , 1999 .

[18]  Philip S. Yu,et al.  A new framework for itemset generation , 1998, PODS '98.

[19]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .