A dynamic rule-induction method for classification in data mining

Rule induction (RI) produces classifiers containing simple yet effective ‘If–Then' rules for decision makers. RI algorithms normally based on PRISM suffer from a few drawbacks mainly related to rule pruning and rule-sharing items (attribute values) in the training data instances. In response to the above two issues, a new dynamic rule induction (DRI) method is proposed. Whenever a rule is produced and its related training data instances are discarded, DRI updates the frequency of attribute values that are used to make the next in-line rule to reflect the data deletion. Therefore, the attribute value frequencies are dynamically adjusted each time a rule is generated rather statically as in PRISM. This enables DRI to generate near perfect rules and realistic classifiers. Experimental results using different University of California Irvine data sets show competitive performance in regards to error rate and classifier size of DRI when compared to other RI algorithms.

[1]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[2]  Fadi A. Thabtah,et al.  Associative Classification Approaches: Review and Comparison , 2014, J. Inf. Knowl. Manag..

[3]  Fadi A. Thabtah,et al.  Mr-arm: a Map-Reduce Association Rule Mining Framework , 2013, Parallel Process. Lett..

[4]  Mo Adda,et al.  P-Prism: A Computationally Efficient Approach to Scaling up Classification Rule Induction , 2008, IFIP AI.

[5]  S. Ravi,et al.  Relevant association rule mining from medical dataset using new irrelevant rule elimination technique , 2013, 2013 International Conference on Information Communication and Embedded Systems (ICICES).

[6]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[7]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[8]  J. R. Quinlan,et al.  Data Mining Tools See5 and C5.0 , 2004 .

[9]  Fadi A. Thabtah,et al.  MAC: A Multiclass Associative Classification Algorithm , 2012, J. Inf. Knowl. Manag..

[10]  Marcel Abendroth,et al.  Data Mining Practical Machine Learning Tools And Techniques With Java Implementations , 2016 .

[11]  T. L. McCluskey,et al.  Predicting phishing websites based on self-structuring neural network , 2013, Neural Computing and Applications.

[12]  Jadzia Cendrowska,et al.  PRISM: An Algorithm for Inducing Modular Rules , 1987, Int. J. Man Mach. Stud..

[13]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[14]  Mary K. Coulter Strategic Management in Action , 1997 .

[15]  Peter I. Cowling,et al.  MMAC: a new multi-class, multi-label associative classification approach , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[16]  Fadi A. Thabtah,et al.  Parallel Associative Classification Data Mining Frameworks Based MapReduce , 2015, Parallel Process. Lett..

[17]  Fadi A. Thabtah,et al.  Phishing detection based Associative Classification data mining , 2014, Expert Syst. Appl..