A new approach for classification of patterns having categorical attributes

This paper proposes a novel approach for classification of patterns having categorical attributes using decision trees. A new split measure has been proposed for construction of decision trees. Main focus of the proposed split measure is to improve the classification accuracy. Performance of the proposed split measure has been compared with the well known split measure information gain used in ID3 algorithm. It has been shown that the proposed split measure outperforms information gain on benchmark datasets taken from UCI machine learning repository.

[1]  Ming Dong,et al.  Classifiability based omnivariate decision trees , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[2]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[3]  Carlo Zaniolo,et al.  CMP: a fast decision tree classifier using multivariate predictions , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[4]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[5]  John Mingers Inducing Rules for Expert Systems - Statistical Aspects , 1986 .

[6]  Steven Salzberg,et al.  Lookahead and Pathology in Decision Tree Induction , 1995, IJCAI.

[7]  Sati Mazumdar,et al.  Elegant decision tree algorithm for classification in data mining , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering (Workshops), 2002..

[8]  P. P. Chakrabarti,et al.  Improving Greedy Algorithms by Lookahead-Search , 1994, J. Algorithms.

[9]  B. Chandra,et al.  On Improving Efficiency of SLIQ Decision Tree Algorithm , 2007, 2007 International Joint Conference on Neural Networks.

[10]  Ravi Kothari,et al.  Look-ahead based fuzzy decision tree induction , 2001, IEEE Trans. Fuzzy Syst..

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  Sanjay Ranka,et al.  CLOUDS: A Decision Tree Classifier for Large Datasets , 1998, KDD.

[13]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[14]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[15]  Jorma Rissanen,et al.  SLIQ: A Fast Scalable Classifier for Data Mining , 1996, EDBT.

[16]  Kyuseok Shim,et al.  PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning , 1998, Data Mining and Knowledge Discovery.

[17]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[18]  Ravi Kothari,et al.  DECISION TREES FOR CLASSIFICATION: A REVIEW AND SOME NEW RESULTS , 2001 .

[19]  Tapio Elomaa,et al.  On the Well-Behavedness of Important Attribute Evaluation Functions , 1998, SCAI.