Integrating Association Rules with Decision Trees in Object-Relational Databases

Research has provided evidence that associative classification produces more accurate results compared to other classification models. The Classification Based on Association (CBA) is one of the famous Associative Classification algorithms that generates accurate classifiers. However, current association classification algorithms reside external to databases, which reduces the flexibility of enterprise analytics systems. This paper implements the CBA in Oracle database using two variant models: hardcoding the CBA in Oracle Data Mining (ODM) package and Integrating Oracle Apriori model with the Oracle Decision tree model. We compared the proposed model performance with Naive Bayes, Support Vector Machine, Random Forests, and Decision Tree over 18 datasets from UCI. Results showed that our models outperformed the original CBA model with 1 percent and is competitive to chosen classification models over benchmark datasets.

[1]  Manish Agrawal,et al.  Impact of incorrect and new requirements on waterfall software project outcomes , 2018, Empirical Software Engineering.

[2]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[3]  Jirí Filip,et al.  Classification based on Associations (CBA) - A Performance Analysis , 2018, RuleML+RR.

[4]  Kamal Ali,et al.  Partial Classification Using Association Rules , 1997, KDD.

[5]  Elena Baralis,et al.  On support thresholds in associative classification , 2004, SAC '04.

[6]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[7]  Zulfiqar Ali,et al.  Empirical Study of Associative Classifiers on Imbalanced Datasets in KEEL , 2018, 2018 9th International Conference on Information, Intelligence, Systems and Applications (IISA).

[8]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[9]  Peter I. Cowling,et al.  MCAR: multi-class classification based on association rule , 2005, The 3rd ACS/IEEE International Conference onComputer Systems and Applications, 2005..

[10]  Susie Stephens,et al.  Oracle Data Mining , 2005 .

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[14]  R. Mike Cameron-Jones,et al.  FOIL: A Midterm Report , 1993, ECML.

[15]  Jaber Alwidian,et al.  WCBA: Weighted classification based on association rules algorithm for breast cancer disease , 2018, Appl. Soft Comput..

[16]  Osmar R. Zaïane,et al.  An associative classifier based on positive and negative rules , 2004, DMKD '04.

[17]  Shanlin Yang,et al.  CSMC: A combination strategy for multi-class classification based on multiple association rules , 2008, Knowl. Based Syst..

[18]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[19]  Wynne Hsu,et al.  Mining association rules with multiple minimum supports , 1999, KDD '99.

[20]  Elena Baralis,et al.  A lazy approach to pruning classification rules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[21]  “Alaa Al Deen,et al.  CLASSIFICATION BASED ON ASSOCIATION-RULE MINING TECHNIQUES : A GENERAL SURVEY AND EMPIRICAL COMPARATIVE EVALUATION , 2011 .

[22]  A Anitha,et al.  Predicting Dengue Using Fuzzy Association Rule Mining , 2019 .

[23]  Bin Liu,et al.  A novel software defect prediction based on atomic class-association rule mining , 2018, Expert Syst. Appl..

[24]  Xing Zhang,et al.  A new approach to classification based on association rule mining , 2006, Decis. Support Syst..

[25]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[26]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[27]  Geoffrey I. Webb,et al.  Advances in Knowledge Discovery and Data Mining , 2018, Lecture Notes in Computer Science.

[28]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[29]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[30]  Peter I. Cowling,et al.  MMAC: a new multi-class, multi-label associative classification approach , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[31]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[32]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[33]  Mohsen Attaran,et al.  Cloud Computing Technology:A viable Option for Smalland Medium-Sized Businesses , 2018 .

[34]  Sebastián Ventura,et al.  Evaluating associative classification algorithms for Big Data , 2019, Big Data Analytics.