Associative Classification with Prediction Confidence

Associative classification which uses association rules for classification has achieved high accuracy in comparison with other classification approaches. However, the confidence measure which is conventionally used for selecting association rules for classification may not conform to the prediction accuracy of the rules. In this paper, we propose a measure called prediction confidence to measure the prediction accuracy of association rules. In addition, a probabilistic-based approach for estimating prediction confidence of association rules is given and its performance is evaluated. The use of prediction confidence helps improve the performance of associative classifiers.

[1]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[2]  João Gama,et al.  Cascade Generalization , 2000, Machine Learning.

[3]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[4]  Peter A. Flach,et al.  Delegating classifiers , 2004, ICML.

[5]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[6]  Osmar R. Zaïane,et al.  Learning to Use a Learned Model: A Two-Stage Approach to Classification , 2006, Sixth International Conference on Data Mining (ICDM'06).

[7]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[8]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[9]  Peter A. Flach,et al.  Rule Evaluation Measures: A Unifying View , 1999, ILP.

[10]  Sanjay Chawla,et al.  CCCS: a top-down associative classifier for imbalanced class distribution , 2006, KDD '06.

[11]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[12]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[13]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[14]  Mohammed J. Zaki,et al.  Multi-evidence, multi-criteria, lazy associative document classification , 2006, CIKM '06.

[15]  Ron Kohavi,et al.  MLC++: a machine learning library in C++ , 1994, Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94.

[16]  Shlomo Argamon,et al.  Arbitrating Among Competing Classifiers Using Learned Referees , 2001, Knowledge and Information Systems.

[17]  Alexander K. Seewald,et al.  Exploring the parameter state space of stacking , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[18]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[19]  Howard J. Hamilton,et al.  Evaluation of Interestingness Measures for Ranking Discovered Knowledge , 2001, PAKDD.

[20]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[21]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[22]  Mykola Pechenizkiy,et al.  Dynamic Integration with Random Forests , 2006, ECML.

[23]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[24]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[25]  Bing Liu,et al.  Classification Using Association Rules: Weaknesses and Enhancements , 2001 .