FACISME: Fuzzy associative classification using iterative scaling and maximum entropy

All associative classifiers developed till now are crisp in nature, and thus use sharp partitioning to transform numerical attributes to binary ones like “Income = [100K and above]”. On the other hand, the novel fuzzy associative classification algorithm called FACISME, which we propose in this paper, uses fuzzy logic to convert numerical attributes to fuzzy attributes, like “Income = High”, thus maintaining the integrity of information conveyed by such numerical attributes. Moreover, FACISME is based on maximum entropy, and uses iterative scaling, both of which lend a very strong theoretical foundation to the algorithm. Entropy is one of the best measures of information, and maximum-entropy-based algorithms do not assume independence of parameters in the classification process. Thus, FACISME provides very good accuracy, and can work with all types of datasets (irrespective of size and type of attributes — numerical or binary) and domains.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[3]  Chris Cornelis,et al.  Elicitation of fuzzy association rules from positive and negative examples , 2005, Fuzzy Sets Syst..

[4]  Vikram Pudi,et al.  Fuzzy Logic-based Pre-processing for Fuzzy Association Rule Mining , 2008 .

[5]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[6]  I. Good Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency Tables , 1963 .

[7]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[8]  Vikram Pudi,et al.  Fuzzy association rule mining algorithm for fast and efficient performance on very large datasets , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[9]  Mohammed J. Zaki Generating non-redundant association rules , 2000, KDD '00.

[10]  John D. Lafferty,et al.  Statistical Models for Text Segmentation , 1999, Machine Learning.

[11]  Eyke Hüllermeier,et al.  FURIA: an algorithm for unordered fuzzy rule induction , 2009, Data Mining and Knowledge Discovery.

[12]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[13]  Antonio González Muñoz,et al.  Table Ii Tc Pattern Recognition Result for 120 Eir Satellite Image Cases Selection of Relevant Features in a Fuzzy Genetic Learning Algorithm , 2001 .

[14]  Etienne E. Kerre,et al.  Computationally Efficient Mining for Fuzzy Implication-Based Association Rules in Quantitative Databases , 2004, Int. J. Gen. Syst..

[15]  Franz Pernkopf,et al.  Bayesian network classifiers versus selective k-NN classifier , 2005, Pattern Recognit..

[16]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[17]  Fadi A. Thabtah,et al.  A review of associative classification mining , 2007, The Knowledge Engineering Review.

[18]  Eyke Hüllermeier,et al.  FR3: A Fuzzy Rule Learner for Inducing Reliable Classifiers , 2009, IEEE Transactions on Fuzzy Systems.

[19]  Eyke Hüllermeier,et al.  A systematic approach to the assessment of fuzzy association rules , 2006, Data Mining and Knowledge Discovery.

[20]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[21]  Martine De Cock,et al.  Fuzzy versus quantitative association rules: a fair data-driven comparison , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Eyke Hüllermeier,et al.  Fuzzy methods in machine learning and data mining: Status and prospects , 2005, Fuzzy Sets Syst..

[23]  Dimitris Meretakis,et al.  Extending naïve Bayes classifiers using long itemsets , 1999, KDD '99.

[24]  Vikram Pudi,et al.  ACME: An Associative Classifier Based on Maximum Entropy Principle , 2005, ALT.

[25]  Antonio González Muñoz,et al.  SLAVE: a genetic learning system based on an iterative approach , 1999, IEEE Trans. Fuzzy Syst..

[26]  J. Darroch,et al.  Generalized Iterative Scaling for Log-Linear Models , 1972 .

[27]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.