论文信息 - ACME: An Associative Classifier Based on Maximum Entropy Principle

ACME: An Associative Classifier Based on Maximum Entropy Principle

Recent studies in classification have proposed ways of exploiting the association rule mining paradigm. These studies have performed extensive experiments to show their techniques to be both efficient and accurate. However, existing studies in this paradigm either do not provide any theoretical justification behind their approaches or assume independence between some parameters. In this work, we propose a new classifier based on association rule mining. Our classifier rests on the maximum entropy principle for its statistical basis and does not assume any independence not inferred from the given dataset. We use the classical generalized iterative scaling algorithm (GIS) to create our classification model. We show that GIS fails in some cases when itemsets are used as features and provide modifications to rectify this problem. We show that this modified GIS runs much faster than the original GIS. We also describe techniques to make GIS tractable for large feature spaces – we provide a new technique to divide a feature space into independent clusters each of which can be handled separately. Our experimental results show that our classifier is generally more accurate than the existing classification methods.

Vikram Pudi | Risi Thonangi

[1] Alberto Maria Segre,et al. Programs for Machine Learning , 1994 .

[2] Wei-Yin Loh,et al. A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[3] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[4] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[5] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[6] Dimitris Meretakis,et al. Extending naïve Bayes classifiers using long itemsets , 1999, KDD '99.

[7] Nir Friedman,et al. Bayesian Network Classifiers , 1997, Machine Learning.

[8] Craig Boutilier,et al. Context-Specific Independence in Bayesian Networks , 1996, UAI.

[9] Jinyan Li,et al. CAEP: Classification by Aggregating Emerging Patterns , 1999, Discovery Science.

[10] Igor Kononenko,et al. Semi-Naive Bayesian Classifier , 1991, EWSL.

[11] J. Darroch,et al. Generalized Iterative Scaling for Log-Linear Models , 1972 .