A Decision Tree Scoring Model Based on Genetic Algorithm and K-Means Algorithm

Credit scoring has been regarded as a critical topic and studied extensively in the finance field. Many artificial intelligence techniques have been used to solve credit scoring. The paper is to build a classification model based on a decision tree by learning historical data. Clustering algorithm and genetic algorithm are combined to further improve the accuracy of this credit scoring model. The clustering algorithm aims at removing noise data, while the genetic algorithm is used to reduce the redundancy attribute of data. The computational results on the two real world benchmark data sets show that the presented hybrid model is efficient.

[1]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[2]  F. Treacy,et al.  Credit Risk Rating at Large U , 1998 .

[3]  Huo-Wang Chen,et al.  A Modified Genetic Programming for Behavior Scoring Problem , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[4]  Mu-Chen Chen,et al.  Credit scoring and rejected instances reassigning through evolutionary computation techniques , 2003, Expert Syst. Appl..

[5]  Mu-Chen Chen,et al.  Credit scoring with a data mining approach based on support vector machines , 2007, Expert Syst. Appl..

[6]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[7]  C ONG,et al.  Building credit scoring models using genetic programming , 2005, Expert Syst. Appl..

[8]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[9]  Holger Frohlich,et al.  Feature Selection for Support Vector Machines by Means of Genetic Algorithms -Diploma Thesis in Computer Science- , 2002 .

[10]  Lijun Wei,et al.  Building Behavior Scoring Model Using Genetic Algorithm and Support Vector Machines , 2007, International Conference on Computational Science.

[11]  Tian-Shyug Lee,et al.  Mining the customer credit using classification and regression tree and multivariate adaptive regression splines , 2006, Comput. Stat. Data Anal..

[12]  Yi Jiang,et al.  A New Credit Scoring Method Based on Rough Sets and Decision Tree , 2008, PAKDD.

[13]  Nan-Chen Hsieh,et al.  Hybrid mining approach in the design of credit scoring models , 2005, Expert Syst. Appl..

[14]  Mark S. Carey,et al.  Credit risk rating systems at large US banks , 2000 .

[15]  Yi Jiang,et al.  A Comparison Study of Credit Scoring Models , 2007, Third International Conference on Natural Computation (ICNC 2007).

[16]  So Young Sohn,et al.  Cluster-based dynamic scoring model , 2007, Expert Syst. Appl..

[17]  Jianping Li,et al.  Support Vector Machines Approach to Credit Assessment , 2004, International Conference on Computational Science.

[18]  Kin Keung Lai,et al.  Neural Network Metalearning for Credit Scoring , 2006, ICIC.

[19]  Bernhard Schölkopf,et al.  Feature selection for support vector machines by means of genetic algorithm , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[20]  Alexander Gammerman,et al.  Machine-learning algorithms for credit-card applications , 1992 .

[21]  Bart Baesens,et al.  Decision Diagrams in Machine Learning: An Empirical Study on Real-Life Credit-Risk Data , 2004, Diagrams.