A Comparison of Methods for Rule Subset Selection Applied to Associative Classification

This paper presents Garss, a new algorithm for rule subset selection based on genetic algorithms, which uses the area under the ROC curve - AUC - as fitness function. Garss is a post-processing method that can be applied to any rule learning algorithm. In this work, Garss is analysed in the context of associative classification, where an association rule algorithm generates a set rules to be used as a classifier. An experimental evaluation was performed in order to analyse the behaviour of the proposed method. Results are compared with Roccer, a recently proposed algorithm for rule subset selection based on ROC analysis.

[1]  Johannes Fürnkranz,et al.  ROC ‘n’ Rule Learning—Towards a Better Understanding of Covering Algorithms , 2005, Machine Learning.

[2]  Miquel Barceló,et al.  Inteligencia Artificial , 2001 .

[3]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[4]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[5]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[6]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[7]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[8]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Applying Genetic and Symbolic Learning Algorithms to Extract Rules from Artificial Neural Networks , 2004, MICAI.

[9]  Christian Borgelt,et al.  Induction of Association Rules: Apriori Implementation , 2002, COMPSTAT.

[10]  Peter A. Flach,et al.  ROCCER: An Algorithm for Rule Learning Based on ROC Analysis , 2005, IJCAI.

[11]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[12]  Wagner Meira,et al.  Rule Generation and Rule Selection Techniques for Cost-Sensitive Associative Classification , 2005, SBBD.

[13]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[14]  Branko Kavsek,et al.  APRIORI-SD: ADAPTING ASSOCIATION RULE LEARNING TO SUBGROUP DISCOVERY , 2006, IDA.

[15]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[16]  Tom Fawcett,et al.  Robust Classification Systems for Imprecise Environments , 1998, AAAI/IAAI.

[17]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[18]  Pedro M. Domingos Unifying Instance-Based and Rule-Based Induction , 1996, Machine Learning.

[19]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[20]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[21]  Nada Lavrac,et al.  Classification Rule Learning with APRIORI-C , 2001, EPIA.

[22]  John A. Swets,et al.  Evaluation of diagnostic systems : methods from signal detection theory , 1982 .

[23]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[24]  D. E. Goldberg,et al.  Genetic Algorithms in Search, Optimization & Machine Learning , 1989 .

[25]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[26]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[27]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.