Fuzzifying Gini Index based decision trees

Crisp decision tree algorithms face the problem of having sharp decision boundaries which may not be found in all real life classification problems. A fuzzy decision tree algorithm Gini Index based (G-FDT) is proposed in this paper to fuzzify the decision boundary without converting the numeric attributes into fuzzy linguistic terms. Gini Index is used as split measure for choosing the most appropriate splitting attribute at each node. The performance of G-FDT algorithm is compared with Gini Index based crisp decision tree algorithm (SLIQ) using several real life datasets taken from the UCI machine learning repository. G-FDT algorithm outperforms its crisp counterpart in terms of classification accuracy. The size of the G-FDT is significantly less compared to SLIQ.

[1]  Kyuseok Shim,et al.  PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning , 1998, Data Mining and Knowledge Discovery.

[2]  Sanjay Ranka,et al.  CLOUDS: A Decision Tree Classifier for Large Datasets , 1998, KDD.

[3]  M. Shaw,et al.  Induction of fuzzy decision trees , 1995 .

[4]  M. Fajfer,et al.  Fuzzy partitioning with FID3.1 , 1999, 18th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.99TH8397).

[5]  Xizhao Wang,et al.  Improving learning accuracy of fuzzy decision trees by hybrid neural networks , 2000, IEEE Trans. Fuzzy Syst..

[6]  Sati Mazumdar,et al.  Elegant decision tree algorithm for classification in data mining , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering (Workshops), 2002..

[7]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[8]  K. I. Ramachandran,et al.  Automatic rule learning using decision tree for fuzzy classifier in fault diagnosis of roller bearing , 2007 .

[9]  Jonathan Lawry,et al.  Decision tree learning with fuzzy labels , 2005, Inf. Sci..

[10]  Louis Wehenkel,et al.  A complete fuzzy decision tree technique , 2003, Fuzzy Sets Syst..

[11]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[12]  Malcolm James Beynon,et al.  The application of fuzzy decision tree analysis in an exposition of the antecedents of audit fees , 2004 .

[13]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[14]  B. Chandra,et al.  On Improving Efficiency of SLIQ Decision Tree Algorithm , 2007, 2007 International Joint Conference on Neural Networks.

[15]  Joung Woo Ryu,et al.  Efficient Fuzzy Rules For Classification , 2006, 2006 International Workshop on Integrating AI and Data Mining.

[16]  Shyi-Ming Chen,et al.  Generating fuzzy rules from training instances for fuzzy classification systems , 2008, Expert Syst. Appl..

[17]  S. Scheer,et al.  1. An overview , 1992 .

[18]  P.E. Maher,et al.  Uncertain reasoning in an ID3 machine learning framework , 1993, [Proceedings 1993] Second IEEE International Conference on Fuzzy Systems.

[19]  Salvatore Ruggieri,et al.  Efficient C4.5 , 2002, IEEE Trans. Knowl. Data Eng..

[20]  C.Z. Janikow,et al.  FID4.1: an overview , 2004, IEEE Annual Meeting of the Fuzzy Information, 2004. Processing NAFIPS '04..

[21]  Richard Weber,et al.  Fuzzy-ID3: A class of methods for automatic knowledge acquisition , 1992 .

[22]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[23]  E. Chong,et al.  Fuzzy modeling by induction and pruning of decision trees , 1994, Proceedings of 1994 9th IEEE International Symposium on Intelligent Control.

[24]  Jorma Rissanen,et al.  SLIQ: A Fast Scalable Classifier for Data Mining , 1996, EDBT.

[25]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[26]  J. Adamo Fuzzy decision trees , 1980 .

[27]  Chengming Qi A New Partition Criterion for Fuzzy Decision Tree Algorithm , 2007 .

[28]  Settimo Termini,et al.  A Definition of a Nonprobabilistic Entropy in the Setting of Fuzzy Sets Theory , 1972, Inf. Control..

[29]  I. Hatono,et al.  Fuzzy decision trees by fuzzy ID3 algorithm and its application to diagnosis systems , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[30]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[31]  C. M. Liu,et al.  Vague knowledge search in the design for outsourcing using fuzzy decision tree , 2007, Comput. Oper. Res..

[32]  G. Klir,et al.  MEASURES OF UNCERTAINTY AND INFORMATION BASED ON POSSIBILITY DISTRIBUTIONS , 1982 .

[33]  Emmanuel Roux,et al.  Linking clinical measurements and kinematic gait patterns of toe-walking using fuzzy decision trees. , 2007, Gait & posture.

[34]  Cezary Z. Janikow,et al.  Fuzzy decision trees: issues and methods , 1998, IEEE Trans. Syst. Man Cybern. Part B.