Gradient Learning Approach for Variable Selection in Credit Scoring

The number of variables used for credit scoring can be quite large, and selecting the most relevant variables becomes an important topic. In this paper, we use gradient learning method for variable selection in credit scoring. The original method in the literature does not work on credit datasets because of the large sample size. To conquer this, we modify the algorithm by resampling data and voting effective variables. Compared with traditional variable selection methods, our approach can handle nonlinear models.

[1]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[2]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[3]  Ian Witten,et al.  Data Mining , 2000 .

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  Jonathan N. Crook,et al.  Credit Scoring and Its Applications , 2002, SIAM monographs on mathematical modeling and computation.

[6]  Kin Keung Lai,et al.  Credit Risk Evaluation with Least Square Support Vector Machine , 2006, RSKT.

[7]  Sayan Mukherjee,et al.  Estimation of Gradients and Coordinate Covariation in Classification , 2006, J. Mach. Learn. Res..

[8]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[9]  Ralf Stecking,et al.  Support vector machines for classifying and describing credit applicants: detecting typical and critical regions , 2005, J. Oper. Res. Soc..

[10]  Mu-Chen Chen,et al.  Credit scoring with a data mining approach based on support vector machines , 2007, Expert Syst. Appl..

[11]  Sayan Mukherjee,et al.  Learning Coordinate Covariances via Gradients , 2006, J. Mach. Learn. Res..

[12]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[13]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[14]  Vijay S. Desai,et al.  A comparison of neural networks and linear scoring models in the credit union environment , 1996 .

[15]  David J. Hand,et al.  A survey of the issues in consumer credit modelling research , 2005, J. Oper. Res. Soc..

[16]  David J. Hand,et al.  Statistical Classification Methods in Consumer Credit Scoring: a Review , 1997 .