ATTRIBUTE SELECTION MEASURE IN DECISION TREE GROWING

  Laviniu Aurelian Badulescu   University of Craiova, Faculty of Automation, Computers and Electronics, Software Engineering Department     Abstract: One of the major tasks in Data Mining is classification. The growing of Decision Tree from data is a very efficient technique for learning classifiers. The selection of an attribute used to split the data set at each Decision Tree node is fundamental to properly classify objects; a good selection will improve the accuracy of the classification. In this paper, we study the behavior of the Decision Trees induced with 14 attribute selection measures over three data sets taken from UCI Machine Learning Repository. Copyright © 2007 Laviniu Aurelian Badulescu. All rights reserved.

[1]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[2]  Mehmed Kantardzic,et al.  Data Mining: Concepts, Models, Methods, and Algorithms , 2002 .

[3]  D. Michie Personal models of rationality , 1990 .

[4]  L. Wehenkel On uncertainty measures used for decision tree induction , 1996 .

[5]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[6]  Christian Borgelt Data mining with graphical models , 2000, Ausgezeichnete Informatikdissertationen.

[7]  Igor Kononenko,et al.  On Biases in Estimating Multi-Valued Attributes , 1995, IJCAI.

[8]  Paul W. Baim A Method for Attribute Selection in Inductive Learning Systems , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Wei Zhong Liu,et al.  Bias in information-based measures in decision tree induction , 1994, Machine Learning.

[10]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[11]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[12]  Rudolf Kruse,et al.  Intelligent data analysis with fuzzy decision trees , 2007, Soft Comput..

[13]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[14]  C. Borgelt,et al.  Evaluation measures for learning probabilistic and possibilistic networks , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[15]  Ramón López de Mántaras,et al.  A distance-based attribute selection measure for decision tree induction , 1991, Machine Learning.