Uncertain decision tree for bank marketing classification

Abstract This study proposes a novel decision tree for uncertain data, called the uncertain decision tree (UDT), based on the uncertain genetic clustering algorithm (UGCA). UDT extends the decision tree to handle data with uncertain information, in which the uncertainty must be considered to obtain high quality results. In UDT, UGCA automatically searches for the proper number of branches of each node, based on the classification error rate and the classification time of UDT. Restated, UGCA reduces both the classification error rate and computing time and, then, optimizes the proposed UDT. Before the UDT is designed using UGCA, an uncertain merging algorithm (UMA) is also developed to reduce the uncertain data set, thereby allowing UGCA to process a large data set efficiently. Importantly, experimental results demonstrate that the proposed UDT outperforms traditional uncertain decision trees.

[1]  Xizhao Wang,et al.  A comparative study on heuristic algorithms for generating fuzzy decision trees , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[2]  Jason M. Daida,et al.  What Makes a Problem GP-Hard? Analysis of a Tunably Difficult Problem in Genetic Programming , 1999, Genetic Programming and Evolvable Machines.

[3]  Gonzalo Martínez-Muñoz,et al.  Using all data to generate decision tree ensembles , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[4]  Gilles Louppe,et al.  Understanding Random Forests: From Theory to Practice , 2014, 1407.7502.

[5]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[6]  Witold Pedrycz,et al.  C-fuzzy decision trees , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[7]  Ethem Alpaydin,et al.  Omnivariate decision trees , 2001, IEEE Trans. Neural Networks.

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Edward J. Delp,et al.  An Iterative Growing and Pruning Algorithm for Classification Tree Design , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[11]  Xizhao Wang,et al.  On the optimization of fuzzy decision trees , 2000, Fuzzy Sets Syst..

[12]  Antonio Nucci,et al.  Fuzzy-Clustering-Based Decision Tree Approach for Large Population Speaker Identification , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Sanjay Kumar Shukla,et al.  GA Guided Cluster Based Fuzzy Decision Tree for Reactive Ion Etching Modeling: A Data Mining Approach , 2012, IEEE Transactions on Semiconductor Manufacturing.

[14]  Witold Pedrycz,et al.  Designing decision trees with the use of fuzzy granulation , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[15]  Li Yi,et al.  A New Genetic Programming Algorithm for Building Decision Tree , 2011 .

[16]  Robin Genuer,et al.  Arbres CART et For{\^e}ts al{\'e}atoiresImportance et s{\'e}lection de variables , 2016, 1610.08203.

[17]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[18]  Jason M. Daida,et al.  What Makes a Problem GP-Hard? Validating a Hypothesis of Structural Causes , 2003, GECCO.

[19]  Sudha Ram,et al.  Constrained cascade generalization of decision trees , 2004, IEEE Transactions on Knowledge and Data Engineering.

[20]  Simon Kasif,et al.  A System for Induction of Oblique Decision Trees , 1994, J. Artif. Intell. Res..

[21]  J. Ross Quinlan,et al.  Learning logical definitions from relations , 1990, Machine Learning.

[22]  Paulo Cortez,et al.  A data-driven approach to predict the success of bank telemarketing , 2014, Decis. Support Syst..