Data mining with decision trees and decision rules

Abstract This paper describes the use of decision tree and rule induction in data-mining applications. Of methods for classification and regression that have been developed in the fields of pattern recognition, statistics, and machine learning, these are of particular interest for data mining since they utilize symbolic and interpretable representations. Symbolic solutions can provide a high degree of insight into the decision boundaries that exist in the data, and the logic underlying them. This aspect makes these predictive-mining techniques particularly attractive in commercial and industrial data-mining applications. We present here a synopsis of some major state-of-the-art tree and rule mining methodologies, as well as some recent advances.

[1]  Padhraic Smyth,et al.  An Information Theoretic Approach to Rule Induction from Databases , 1992, IEEE Trans. Knowl. Data Eng..

[2]  Se June Hong R-MINI: An Iterative Approach for Generating Minimal Rules from Examples , 1997, IEEE Trans. Knowl. Data Eng..

[3]  J. Rissanen Stochastic Complexity in Statistical Inquiry Theory , 1989 .

[4]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[5]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[6]  H. Scheffé,et al.  The Analysis of Variance , 1960 .

[7]  Usama M. Fayyad,et al.  Automating the Analysis and Cataloging of Sky Surveys , 1996, Advances in Knowledge Discovery and Data Mining.

[8]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[9]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[10]  Igor Kononenko,et al.  Attribute selection for modelling , 1997, Future Gener. Comput. Syst..

[11]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[12]  Sholom M. Weiss,et al.  Computer Systems That Learn , 1990 .

[13]  Madhu Sudan,et al.  A statistical perspective on data mining , 1997, Future Gener. Comput. Syst..

[14]  Mike James,et al.  Classification Algorithms , 1986, Encyclopedia of Machine Learning and Data Mining.

[15]  William W. Cohen Fast Eeective Rule Induction , 1995 .

[16]  Sholom M. Weiss,et al.  Rule-based Machine Learning Methods for Functional Prediction , 1995, J. Artif. Intell. Res..

[17]  Sholom M. Weiss,et al.  Optimized rule induction , 1993, IEEE Expert.

[18]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[19]  Chidanand Apté,et al.  Predicting Equity Returns from Securities Data with Minimal Rule Generation , 1994, KDD Workshop.

[20]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[21]  Nada Lavrac,et al.  The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.

[22]  Wray L. Buntine,et al.  Learning classification trees , 1992 .

[23]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[24]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[25]  Jorma Rissanen,et al.  SLIQ: A Fast Scalable Classifier for Data Mining , 1996, EDBT.

[26]  Jude W. Shavlik,et al.  Using neural networks for data mining , 1997, Future Gener. Comput. Syst..

[27]  Sholom M. Weiss,et al.  Automated learning of decision rules for text categorization , 1994, TOIS.

[28]  J. Ross Quinlan,et al.  Combining Instance-Based and Model-Based Learning , 1993, ICML.